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Dwf7 Mutants 

Cross-Reference to Related Application 
This application is related to provisional patent application serial no. 60/179,901, 
filed February 2, 2000, from which priority is claimed under 35 USC §1 19(e)(1) and 
which is incorporated herein by reference in its entirety. 

Technical Field 

The present invention relates generally to plants that display altered structure or 
morphology and to the genes imparting such pheontypes. In particular, the present 
invention pertains to Dwarf? (dwf7) mutants and methods of using the same. 

Background of the Invention 

Sterols are known to play at least two critical roles in plants: as bulk components 
of membranes regulating stability and permeability (Bach et al (1997) Prog. Lipid Res. 
36:197-226) and as precursors of growth-promoting brassinosteroids (BRs; Fujioka and 
Sakurai (1997) Nat. Prod. Rep. 14:1-10). Lesions in brassinosteroid (BR) biosynthetic 
genes result in characteristic dwarf phenotypes in plants. Understanding the regulation of 
BR biosynthesis demands continued isolation and characterization of mutants 
corresponding to the genes involved in BR biosynthesis. 

Sterol biosynthesis in plants has been studied extensively through enzyme 
purification or gene cloning (Grunwald (1975) Annu. Rev. Plant Physiol. 26:209-236; 
Goodwin (1979) Annu. Rev. Plant Physiol. 30:369-404; Benveniste (1986) Annu. Rev. 
Plant Physiol. 37:275-308; Bach and Benveniste (1997) Prog. Lipid Res. 36:197-226). 
Figure 1 shows the proposed biosynthetic pathway from squalene to brassinolide (BL). A 
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major difference between photosynthetic and nonphotosynthetic organisms is that 
cyclization of squalene 2,3 -oxide is bifurcated to a different route for each system 
(Benveniste (1986) Annu. Rev. Plant Physiol. 37:275-308). In animals and yeast, 
squalene 2,3-oxide is cyclized to lanosterol, whereas in photosynthetic organisms it is 
5 cyclized to cycloartenol (Nes and McKean (1977) Biochemistry of Steroids and Other 
Isopentenoids. (Baltimore, MD: University Park Press)). Accordingly, photosynthetic 
organisms require somewhat different biosynthetic enzymes, such as cycloartenol 
synthase (Corey et al. (1993) Proc. Natl. Acad. Sci. USA 90:11628-11632) and 
cycloeucalenol-obtusifoliol isomerase, which are required to open the cyclopropane ring 
10 in cycloartenol (Figure 1). However, most of the enzymatic steps are shared between the 
two different pathways. 

In plants, sterols are subject to a series of modifications before conversion to BL. 
Different sterols, such as 24-methylenecholesterol (24-MC), campesterol (CR), 
isofucosterol, and sitosterol, are converted to the BL congeners dolicholide, BL, 
1 5 28-homodolicholide, and 28-homoBL, respectively, in a species-specific manner (Fujioka 
et aL (1997) Plant Cell 9:1951-1962; Sasse (1997) Physiol. Plant 100:696-701). The 
BR-specific pathway diverges into the early and the late C-6 oxidation pathways. In the 
early C-6 oxidation pathway, introduction of a 6-oxo group occurs before the vicinal 
hydroxylation reactions at the side chain, whereas it occurs after these hydroxylations in 
20 the late C-6 oxidation pathway (Figure 1; Choi et al (1997) Phytochemistry 44:609-613). 

Several mutants, such as constitutive photomorphogenesis and dwarfism (cpd), 
deetiolatedl (det2\ and dwarf4 (dwf4\ have been shown to be defective in the 
BR-specific pathway (Li et al. (1996) Science 272:398-401; Li et al. (1997) Proc. Natl. 
Acad. Sci. USA 94:3554-3559; Szekeres et al. (1996) Cell 85:171-182; Choe et al. (1998) 
25 Plant Cell 10:231-243). These BR biosynthetic dwarfs share a characteristic dwarf 

phenotype, which includes short robust stems, reduced fertility, prolonged life cycle, and 
dark-green, round, and curled leaves when grown in the light. In the dark, these mutants 
exhibit short hypocotyls and expanded cotyledons, cpd (dwf3) mutants are only rescued 
by 23a-hydroxylated compounds (Szekeres et al. (1996) Cell 85:171-182). The CPD 
30 gene was shown to encode a cytochrome P450 steroid hydroxylating enzyme 
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(CYP90A1). In addition, Li et al. (1996) Science 272:398-401 and Li et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:3554-3559 showed that det2/dwf6 is blocked in the C-5 
reduction step. DET2 was found to be homologous to steroid 5a-reductases. Like its 
animal equivalents, DET2 successfully converted progesterone (3-oxo-A 4,5 steroid) to 
5 4,5-dihydroprogesterone in a human cell line. In addition, the human 5a-reductase gene 
effectively complemented det2 mutants (Li et al. (1997) Proc. Natl. Acad. Sci. USA 
94:3554-3559). Recently, it has been shown that DWF4 encodes a cytochrome P450 
whose amino acid sequence is 43% identical to CPD\ DWF4 has been named CYP90B1 
(Choe et al. (1998) Plant Cell 10:231-243). Based on results from feeding studies using 
1 0 BR biosynthetic intermediates, the proposed rate-limiting step of BR biosynthesis, 
22a-hydroxylation, is now known to be blocked in dwf4 mutants. 

In the plant sterol biosynthetic pathway, several of the genes have been cloned or 
identified based on heterologous expression or sequence similarity. First, Corey et al. 
(1993) Proc. Natl Acad. Sci. USA 90:1 1628-1 1632 isolated a cycloartenol synthase 
1 5 cDNA by heterologous complementation of yeast mutants lacking lanosterol synthase. In 
addition, two types of cDNAs encoding sterol methyltransferases have been isolated from 
soybean (Shi et al. (1996) J. Biol. Chem. 271:9384-9389) and Arabidopsis (Husselstein et 
al. (1996) FEBS Lett. 381 : 87-92). The Arabidopsis cDNA has been shown to mediate a 
second methyltransferase step leading to C 29 sterols (Bouvier-Nave et al. (1997) Eur. J. 
20 Biochem. 246:518-529). For the 14oe-demethylation reaction, Bak et al. (1997) Plant J. 

11:191-201 cloned the cDNA encoding the 14-ademethylase cytochrome P450 enzyme 
(CYP51) from Sorghum bicolor. Based on sequence similarity, Grebenok et al. (1997) 
Plant Mol. Biol. 34:891-896 identified an Arabidopsis sterol C-8 isomerase (GenBank 
accession number AF030357). Furthermore, an ERGOSTEROL25 (ERG25) homolog for 
25 Arabidopsis (C-4 demethylase) also has been discovered in the genome sequencing 

project (GenBank accession number AL021635). Finally, a sterol C-7 reductase has been 
cloned by heterologous expression of an Arabidopsis cDNA in yeast (Lecain et al. (1996) 
J. Biol. Chem. 271:10866-10873). 

As compared with the wealth of cloned genes in sterol biosynthesis, only one 
30 mutant has been found in these genes. Gachotte et al. (1995) Plant J. 8:407-416 screened 
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an ethyl methanesulfonate (EMS)-induced mutant population (22,000 M 2 plants) for 
mutants displaying an altered sterol profile. The screen yielded one mutant, steroll 
(stel), whose endogenous level of C-5-desaturated sterols is reduced to 30% of that of the 
wild type. Expression of the yeast gene ERG3 (the gene for A 7 sterol C-5 desaturase) in 
5 the stel-1 mutant increased the level of C-5-desaturated sterols 1 .7- to 2.8-fold compared 

with the stel-1 control, suggesting functional conservation of the enzymes from yeast and 
plants. However, visible phenotypes were not found in stel-1 plants. Thus, the authors 
hypothesized that the residual 30% level of C-5-desaturated sterols was sufficient for the 
growth of plants. 

10 A large collection of BR dwarf mutants have been characterized. Of the eight dwf 

loci identified to date, dwf (cpd\ Szekeres et al. (1996) Cell 85:171-182), dwff (Choe et 
al (1998) Plant Cell 10:231-243), and dwf6 (det2\ Li et al. (1996) Science 272:398-401) 
have been shown to act in the BR biosynthetic pathway, whereas dwf 2 (bril) probably is 
involved in BR perception (Clouse et al. (1996) Plant Physiol. 1 1 1 :671-678; Li and 

15 Chory (1997) Cell 90:929-938). 

Disclosure of the Invention 

The present invention is based on the discovery of various mutants of a BR 
biosynthetic locus, designated dwarf (dwf/). The STE1 locus in dwf/ mutants contain 

20 loss-of-function mutations. Two allelic variants of dwf/ have been characterized, dwp-1 

and dwf -2^ also designated stel-2 and stel-3, respectively. A homologue of the dwf 
mutants, HDF7, is also described herein. Feeding studies with BR biosynthetic 
intermediates and analysis of endogenous levels of BR and sterol biosynthetic 
intermediates indicate that the defective step in the dwf mutants resides before the 

25 production of 24-methylenecholesterol in the sterol biosynthetic pathway. Furthermore, 

results from feeding studies with 13 C-labeled mevalonic acid and compactin show that the 
defective step is specifically the A 7 sterol C-5 desaturation. Sequencing of the STE1 
locus in the two dwf variants shows premature stop codons in the first (dwf -2) and the 
third (dwf-T) exons. Thus, the reduction of BRs in dwf is due to a shortage of substrate 

30 sterols and is the direct cause of the dwarf phenotype in dwf. 
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Accordingly, in one embodiment, the present invention is directed to an isolated 
dwf7 polynucleotide that imparts at least one dwf7 mutant phenotype when expressed in a 
plant. The polynucleotide is selected from the group consisting of (a) a polynucleotide 
comprising the nucleotide sequence depicted at positions 143 to 322 , inclusive, of 
Figures 8A-8D; (b) a polynucleotide comprising the nucleotide sequence depicted at 
positions 143 to 1552, inclusive, of Figures 8A-8D; (c) a polynucleotide comprising a 
nucleotide sequence having at least about 70% identity to the nucleotide sequence of (a) 
or (b); (d) a fragment of (a), (b) or (c) comprising at least about 15 contiguous 
nucleotides; and (e) complements of (a), (b), (c), (d) or (e). 

In other embodiments, the present invention is directed to an isolated dwf7 
polynucleotide that imparts at least one dwf7 mutant phenotype when expressed in a 
plant. The polynucleotide is selected from the group consisting of (a) a polynucleotide 
comprising the nucleotide sequence depicted at positions 1506 to 2720, inclusive, of 
Figures 10A-10F; (b) a polynucleotide comprising a nucleotide sequence having at least 
70% identity to the nucleotide sequence of (b); (c) a fragment of (a) or (b) comprising at 
least 15 contiguous nucleotides; and (d) complements of (a), (b), (c) or (d). 

In additional embodiments, the present invention is directed to recombinant 
vectors comprising the isolated dwf7 polynucleotides described above, and control 
elements that are operably linked to the polynucleotides whereby a coding sequence 
within the polynucleotides can be transcribed and translated in a host cell, and at least one 
of the control elements is heterologous to the coding sequence. Also provided are host 
cells transformed with the recombinant vectors, and methods of producing a DWF7 
polypeptide comprising providing a population of host cells as described above and 
culturing the population of cells under conditions whereby the DWF7 polypeptide 
encoded by the coding sequence present in the recombinant vector is expressed. 

In yet further embodiments, the subject invention is directed to a transgenic plant 
comprising a polynucleotide described above, as well as methods of producing a 
transgenic plant comprising the steps of introducing a polynucleotide into a plant cell to 
produce a transformed plant cell; and producing a transgenic plant from the transformed 
plant cell. 
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In an additional embodiment, the invention is directed to a method for altering the 
sterol composition of a plant relative to the wild-type plant comprising introducing a 
polynucleotide as described above into a plant cell to produce a transformed plant cell 
and producing a transgenic plant from the transformed plant cell, wherein the transgenic 

5 plant has an altered sterol composition relative to the wild-type plant, such as an altered 

cholesterol composition relative to the wild-type plant. 

In still further embodiments, the invention is directed to isolated DWF7 
polypeptides encoded by the polynucleotides as described above. In certain 
embodiments, the polypeptide consists of the amino acid sequence depicted at positions 

10 1-60, inclusive, of Figure 9 or the amino acid sequence depicted at positions 1-230, 

inclusive, of Figure 9. In other embodiments, the polypeptide consists of the amino acid 
sequence depicted at positions 1-279, inclusive, of Figure 11. 

In other embodiments, the subject invention is directed to an isolated control 
element having at least about 70% identity to a control element found within nucleotide 

15 positions 43-142 of Figures 8A-8D, or 1-1505 of Figures 10A-10F, a recombinant vector 

comprising the control element and a polynucleotide comprising a coding sequence 
which is heterologous to the control element, host cells transformed with the recombinant 
vector, and methods of producing a recombinant polypeptide comprising providing a 
population of the host cells and culturing the population of cells under conditions 

20 whereby the recombinant polypeptide encoded by the coding sequence present in the 

recombinant vector is expressed. 

These and other embodiments of the present invention will readily occur to those 
of ordinary skill in the art in view of the disclosure herein. 

25 Brief Description of the Figures 

Figure 1 shows the proposed BL biosynthetic pathway from squalene to BL. 
The BL biosynthetic pathway is divided into the sterol-specific pathway, squalene to 
campesterol, and the BR-specific pathway, campesterol to brassinolide. Common names 
for the compounds are labeled, and proposed enzymes involved in each reaction are 

30 boxed and labeled. Genes identified by mutants are marked. The acronyms for some 
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compounds are in parentheses. In the inset, the carbon atoms of the sterol core rings and 
side chain are numbered. 

Figure 2 is a bar graph of measurements of gynoecia and stamens of wild-type, 
(ecotype Wassilewskija-2 [Ws-2]), dwfl-U and dwf4-3 plants. The dwf7-l plant displays 
5 a concomitant reduction in the length of gynoecia and stamens, whereas dwf4-3 displays a 

greater reduction in stamen length. Each data point represents the average length for five 
flowers. Standard errors are shown at each data point. Solid bars indicate the 
gynoecium and white bars denote the stamen. 

Figure 3 compares the response of light-grown wild-type and dwf7-l hypocotyls 
10 to different concentrations of BL. Black bars indicate results using the Wassilewskija-2 

(Ws-2) wild type and white bars dwft-1 plants. The dwf7-l plant responds to 10" 9 M BL 
and is completely rescued by 10" 8 M BL. Error bars indicate ±SE. 

Figure 4 is a bar graph comparing wild-type and dwf7-l inflorescences treated 
with BR intermediates. The lengths of pedicels treated with water, 6-deoxoCT, 
1 5 22-OHCR, and BL were measured to the nearest millimeter (n > 1 5). The pedicels 

elongated greater than twofold in response to all the BRs tested, suggesting that the 
biosynthetic defect in dwf7-l resides before the production of CR. Error bars indicate 
±SE. 

Figure 5 shows GC-MS analysis of wild-type and dwf7-l seedlings fed with 
20 13 C-MVA in the presence of compactin, an inhibitor of MVA biosynthesis. 

Accumulation of episterol with a simultaneous decrease of downstream intermediates, 
including 24-MC and CR, predicts that the C-5 desaturation step is blocked in dwfi-l 
plants. The units are in micrograms per 5 g fresh weight of tissue. The designation ND 
(not detected) means that the quantity is lower than the detection limit. Ws-2 is the 
25 Wassilewskija-2 wild type. 

Figure 6 is a schematic representation of the STE1 gene. Comparison of cDNA 
and genomic DNA sequences revealed three exons (thick boxes) and two introns 
(horizontal bars). The single open reading frame encodes a protein of 281 amino acids. 
The dwf7-2 (stel-3) mutation is located in the first exon, changing a tryptophan to a stop 
30 codon. The dwf7-l (stel-2) mutation also changes a tryptophan to a stop codon (amino 
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acid position 230). The three white boxes indicate the transmembrane domains, and the 
three histidine boxes are lightly shadowed. The figure is drawn to scale by using the 
GCK software (Textco, Inc., West Lebanon, NH). Bar = 120 bp. 

Figure 7 depicts a multiple sequence alignment of DWF7/STE1 with known 

5 sequences for A 7 sterol C-5 desaturases. The GenBank accession numbers for the 

sequences are M62623 (S. cerevisiae) (SEQ ID NO: ), AB004539 

{Schizosaccharomyces pombe) (SEQ ID NO: ), L40390 (C glabrata) (SEQ ID 

NO: ), and AF105034 (DWF7/STE1, Arabidopsis) (SEQ ID NO: ). The conserved 

transmembrane domains and histidine clusters are boxed and labeled. The positions of 

10 the premature stop codons in dwf7-J and dwfi-2 are indicated with filled circles. 

Histidine residues in each conserved histidine box are identified with filled triangles. A 

consensus sequence (SEQ ID NO: ) is shown in the bottom row of the alignment. 

Capital letters stand for residues conserved among all sequences, whereas lowercase 
letters mean >50% identical. Dashes indicate gaps introduced to maximize alignment. 

15 Multiple sequence alignment was performed using PILEUP in the Genetics Computer 

Group software (Madison, WI) with a gap creation penalty of 4 and a gap extension 
parameter of 1 . The annotation of the aligned sequences was performed using the 
ALSCRIPT software (Barton (1993) Protein Eng. 6:37-40). 

Figures 8A-8D depict the complete gene sequence of dwf7, denoted by a dark 

20 grey bar. The premature stop codons for dwf7-l and dwf7-2 are shown with triangles at 

nucleotide positions 1552 and 322, respectively. The coding sequence and corresponding 
amino acid sequence are represented by a light grey bar. The mRNA sequence is 
represented by a black bar and is shown in three segments. The gene includes two introns 
(positions 369-735 and 1042-1395) and three exons. 

25 Figure 9 shows the amino acid sequence corresponding to the coding sequence 

designated in Figures 8A-8D. The polypeptide sequences corresponding to the dwf7-2 
and dw/7-1 alleles occur at positions 1-60 and 1-230, respectively. 

Figures 10A-10F show the gene sequence of the dwf7 homologue, HDF7. The 
coding sequence and corresponding amino acid sequence are shown in three segments 
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(exons), occurring at positions 1506-1734, 2024-2329 and 2416-2720 of the figure. The 
5 ! UTR is shown at positions 1-1505 and the 3' UTR occurs at positions 2721-2925. 

Figure 1 1 shows the amino acid sequence corresponding to the coding sequence 
designated in Figures 10A-10F, The polypeptide sequence corresponding to the HDF7 
5 dwf7 polypeptide occurs at positions 1-230 of the figure. 

Detailed Description of the Invention 

The practice of the present invention will employ, unless otheiwise indicated, 
conventional methods of protein chemistry, biochemistry, recombinant DNA techniques 
1 0 and pharmacology, within the skill of the art. Such techniques are explained fully in the 
literature. See, e.g., Evans, et al., Handbook of Plant Cell Culture (1983, Macmillan 
Publishing Co.); Binding, Regeneration of Plants, Plant Protoplasts (1985, CRC Press); 
Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); 
Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); 
1 5 Remington 's Pharmaceutical Sciences, 1 8th Edition (Easton, Pennsylvania: Mack 
Publishing Company, 1990). 

All publications, patents and patent applications cited herein, whether supra or 
infra, are hereby incorporated by reference in their entirety. 

It must be noted that, as used in this specification and the appended claims, the 
20 singular forms "a", "an" and "the" include plural referents unless the content clearly 

dictates otherwise. Thus, for example, reference to "a polypeptide" includes a mixture of 
two or more polypeptides, and the like. 

The following amino acid abbreviations are used throughout the text: 
Alanine: Ala (A) Arginine: Arg (R) 

25 Asparagine: Asn (N) Aspartic acid: Asp (D) 

Cysteine: Cys (C) Glutamine: Gin (Q) 

Glutamic acid: Glu (E) Glycine: Gly (G) 
Histidine: His (H) Isoleucine: He (I) 

Leucine: Leu (L) Lysine: Lys (K) 

30 Methionine: Met (M) Phenylalanine: Phe (F) 
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Proline: Pro (P) 
Threonine: Thr (T) 
Tyrosine: Tyr (Y) 



Serine: Ser(S) 
Tryptophan: Trp (W) 
Valine: Val (V) 



5 I Definitions 

In describing the present invention, the following terms will be employed, and are 
intended to be defined as indicated below. 

The terms "nucleic acid molecule" and "polynucleotide" are used interchangeably 
and refer to a polymeric form of nucleotides of any length, either deoxyribonucleotides or 

1 0 ribonucleotides, or analogs thereof. This term refers only to the primary structure of the 
molecule and thus includes double- and single-stranded DNA and RNA. It also includes 
known types of modifications, for example, labels which are known in the art, 
methylation, "caps", substitution of one or more of the naturally occurring nucleotides 
with an analog, internucleotide modifications such as, for example, those with uncharged 

15 linkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, 

etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), those 
containing pendant moieties, such as, for example proteins (including e.g., nucleases, 
toxins, antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., 
acridine, psoralen, etc.), those containing chelates (e.g., metals, radioactive metals, boron, 

20 oxidative metals, etc.), those containing alkylators, those with modified linkages (e.g., 
alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucleotide. 
Polynucleotides may have any three-dimensional structure, and may perform any 
function, known or unknown. Nonlimiting examples of polynucleotides include a gene, a 
gene fragment, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, 

25 ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, 
vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid 
probes, and primers. 

A polynucleotide is typically composed of a specific sequence of four nucleotide 
bases: adenine (A); cytosine (C); guanine (G); and thymine (T) (uracil (U) for thymine 

30 (T) when the polynucleotide is RNA). Thus, the term polynucleotide sequence is the 
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alphabetical representation of a polynucleotide molecule. This alphabetical 
representation can be input into databases in a computer having a central processing unit 
and used for bioinformatics applications such as functional genomics and homology 
searching. 

5 Techniques for determining nucleic acid and amino acid "sequence identity" are 

known in the art. Typically, such techniques include determining the nucleotide sequence 
of the mRNA for a gene and/or determining the amino acid sequence encoded thereby, 
and comparing these sequences to a second nucleotide or amino acid sequence. In 
general, "identity" refers to an exact nucleotide-to-nucleotide or amino acid-to-amino 

10 acid correspondence of two polynucleotides or polypeptide sequences, respectively. Two 
or more sequences (polynucleotide or amino acid) can be compared by determining their 
"percent identity." The percent identity of two sequences, whether nucleic acid or amino 
acid sequences, is the number of exact matches between two aligned sequences divided 
by the length of the shorter sequences and multiplied by 100. An approximate alignment 

1 5 for nucleic acid sequences is provided by the local homology algorithm of Smith and 

Waterman, Advances in Applied Mathematics 2:482-489 (1981). This algorithm can be 
applied to amino acid sequences by using the scoring matrix developed by Dayhoff, Atlas 
of Protein Sequences and Structure . M.O. Dayhoff ed. ? 5 suppl. 3:353-358, National 
Biomedical Research Foundation, Washington, D.C., USA, and normalized by Gribskov, 

20 Nucl Acids Res. 14(6):6745-6763 (1986). An exemplary implementation of this 
algorithm to determine percent identity of a sequence is provided by the Genetics 
Computer Group (Madison, WI) in the "BestFit" utility application. The default 
parameters for this method are described in the Wisconsin Sequence Analysis Package 
Program Manual, Version 8 (1995) (available from Genetics Computer Group, Madison, 

25 WI). A preferred method of establishing percent identity in the context of the present 

invention is to use the MPSRCH package of programs copyrighted by the University of 
Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by 
IntelliGenetics, Inc. (Mountain View, CA). From this suite of packages the Smith- 
Waterman algorithm can be employed where default parameters are used for the scoring 

30 table (for example, gap open penalty of 12, gap extension penalty of one, and a gap of 
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six). From the data generated the "Match" value reflects "sequence identity." Other 
suitable programs for calculating the percent identity or similarity between sequences are 
generally known in the art, for example, another alignment program is BLAST, used with 
default parameters. For example, BLASTN and BLASTP can be used using the 
5 following default parameters: genetic code = standard; filter = none; strand = both; cutoff 

= 60; expect =10; Matrix = BLOSUM62; Descriptions = 50 sequences; sort by = HIGH 
SCORE; Databases - non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank 
CDS translations + Swiss protein 4- Spupdate + PIR. Details of these programs can be 
found at the following internet address: http://www.ncbi.nlm.gov/cgi-bin/BLAST. 

1 0 Alternatively, the degree of sequence similarity between polynucleotides can be 

determined by hybridization of polynucleotides under conditions that form stable 
duplexes between homologous regions, followed by digestion with single-stranded- 
specific nuclease(s), and size determination of the digested fragments,. Two DNA, or two 
polypeptide sequences are "substantially homologous" to each other when the sequences 

15 exhibit at least about 70%-85%, preferably at least about 85%-90% more preferably at 

least about 90%-95%, and most preferably at least about 95%-98% sequence identity 
over a defined length of the molecules, or any percentage between the above-specified 
ranges, as determined using the methods above. As used herein, substantially 
homologous also refers to sequences showing complete identity to the specified DNA or 

20 polypeptide sequence. DNA sequences that are substantially homologous can be 

identified in a Southern hybridization experiment under, for example, stringent 
conditions, as defined for that particular system. Defining appropriate hybridization 
conditions is within the skill of the art. See, e.g., Sambrook et al, supra; DNA Cloning, 
supra; Nucleic Acid Hybridization, supra. 

25 The degree of sequence identity between two nucleic acid molecules affects the 

efficiency and strength of hybridization events between such molecules. A partially 
identical nucleic acid sequence will at least partially inhibit a completely identical 
sequence from hybridizing to a target molecule. Inhibition of hybridization of the 
completely identical sequence can be assessed using hybridization assays that are well 

30 known in the art (e.g., Southern blot, Northern blot, solution hybridization, or the like, see 
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Sambroqk, et al., Molecular Cloning: A Laboratory Manual, Second Edition, (1989) 
Cold Spring Harbor, N.Y.). Such assays can be conducted using varying degrees of 
selectivity, for example, using conditions varying from low to high stringency. If 
conditions of low stringency are employed, the absence of non-specific binding can be 
5 assessed using a secondary probe that lacks even a partial degree of sequence identity (for 

example, a probe having less than about 30% sequence identity with the target molecule), 
such that, in the absence of non-specific binding events, the secondary probe will not 
hybridize to the target. 

When utilizing a hybridization-based detection system, a nucleic acid probe is 
10 chosen that is complementary to a target nucleic acid sequence, and then by selection of 

appropriate conditions the probe and the target sequence "selectively hybridize," or bind, 
to each other to form a hybrid molecule. A nucleic acid molecule that is capable of 
hybridizing selectively to a target sequence under "moderately stringent" typically 
hybridizes under conditions that allow detection of a target nucleic acid sequence of at 
15 least about 10-14 nucleotides in length having at least approximately 70% sequence 

identity with the sequence of the selected nucleic acid probe. Stringent hybridization 
conditions typically allow detection of target nucleic acid sequences of at least about 10- 
14 nucleotides in length having a sequence identity of greater than about 90-95% with the 
sequence of the selected nucleic acid probe. Hybridization conditions useful for 
20 probe/target hybridization where the probe and target have a specific degree of sequence 

identity, can be determined as is known in the art (see, for example, Nucleic Acid 
Hybridization: A Practical Approach, editors B.D. Hames and S J. Higgins, (1985) 
Oxford; Washington, DC; IRL Press). 

With respect to stringency conditions for hybridization, it is well known in the art 
25 that numerous equivalent conditions can be employed to establish a particular stringency 

by varying, for example, the following factors: the length and nature of probe and target 
sequences, base composition of the various sequences, concentrations of salts and other 
hybridization solution components, the presence or absence of blocking agents in the 
hybridization solutions (e.g., formamide, dextran sulfate, and polyethylene glycol), 
30 hybridization reaction temperature and time parameters, as well as, varying wash 
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conditions. The selection of a particular set of hybridization conditions is selected 
following standard methods in the art (see, for example, Sambrook, et al., Molecular 
Cloning: A Laboratory Manual Second Edition, (1989) Cold Spring Harbor, N.Y.). 
A "gene" as used in the context of the present invention is a sequence of 
5 nucleotides in a genetic nucleic acid (chromosome, plasmid, etc.) with which a genetic 
function is associated. A gene is a hereditary unit, for example of an organism, 
comprising a polynucleotide sequence that occupies a specific physical location (a "gene 
locus" or "genetic locus") within the genome of an organism. A gene can encode an 
expressed product, such as a polypeptide or a polynucleotide (e.g., tRNA). Alternatively, 
10 a gene may define a genomic location for a particular event/function, such as the binding 

of proteins and/or nucleic acids, wherein the gene does not encode an expressed product. 
Typically, a gene includes coding sequences, such as, polypeptide encoding sequences, 
and non-coding sequences, such as, promoter sequences, polyadenlyation sequences, 
transcriptional regulatory sequences (e.g., enhancer sequences). Many eucaryotic genes 
15 have "exons" (coding sequences) interrupted by "introns" (non-coding sequences). In 

certain cases, a gene may share sequences with another gene(s) (e.g., overlapping genes). 

A "coding sequence" or a sequence which "encodes" a selected polypeptide, is a 
nucleic acid molecule which is transcribed (in the case of DNA) and translated (in the 
case of mRNA) into a polypeptide, for example, in vivo when placed under the control of 
20 appropriate regulatory sequences (or "control elements"). The boundaries of the coding 

sequence are typically determined by a start codon at the 5' (amino) terminus and a 
translation stop codon at the 3 ! (carboxy) terminus. A coding sequence can include, but is 
not limited to, cDNA from viral, procaryotic or eucaryotic mRNA, genomic DNA 
sequences from viral or procaryotic DNA, and even synthetic DNA sequences. A 
25 transcription termination sequence may be located 3 ? to the coding sequence. Other 

"control elements" may also be associated with a coding sequence. A DNA sequence 
encoding a polypeptide can be optimized for expression in a selected cell by using the 
codons preferred by the selected cell to represent the DNA copy of the desired 
polypeptide coding sequence. "Encoded by" refers to a nucleic acid sequence which 
30 codes for a polypeptide sequence, wherein the polypeptide sequence or a portion thereof 
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contains an amino acid sequence of at least 3 to 5 amino acids, more preferably at least 8 
to 10 amino acids, and even more preferably at least 15 to 20 amino acids from a 
polypeptide encoded by the nucleic acid sequence. Also encompassed are polypeptide 
sequences which are immunologically identifiable with a polypeptide encoded by the 

5 sequence. 

Typical "control elements", include, but are not limited to, transcription 
promoters, transcription enhancer elements, transcription termination signals, 
polyadenylation sequences (located 3' to the translation stop codon), sequences for 
optimization of initiation of translation (located 5' to the coding sequence), translation 

10 enhancing sequences, and translation termination sequences. Transcription promoters 

can include inducible promoters (where expression of a polynucleotide sequence operably 
linked to the promoter is induced by an analyte, cofactor, regulatory protein, etc.), 
repressible promoters (where expression of a polynucleotide sequence operably linked to 
the promoter is induced by an analyte, cofactor, regulatory protein, etc.), and constitutive 

15 promoters. For purposes of the present invention, control elements for the dwf7 gene are 

found in the 5 f and 3 T UTRs shown in Figures 8A-8B, particularly at positions 43-142 and 
1710-1890, respectively, of the figure. Control elements for HDF7 are found within the 
5 f and 3 f UTRs shown in Figures 10A-10F, particularly within the region between 
positions 1-1505 and 2721-2925, respectively. 

20 A control element, such as a promoter, "directs the transcription" of a coding 

sequence in a cell when RNA polymerase will bind the promoter and transcribe the 
coding sequence into mRNA, which is then translated into the polypeptide encoded by 
the coding sequence. 

"Expression enhancing sequences" typically refer to control elements that 

25 improve transcription or translation of a polynucleotide relative to the expression level in 

the absence of such control elements (for example, promoters, promoter enhancers, 
enhancer elements, and translational enhancers (e.g., Shine and Delagarno sequences). 

"Operably linked" refers to a juxtaposition wherein the components so described 
are in a relationship permitting them to function in their intended manner. A control 

30 sequence "operably linked" to a coding sequence is ligated in such a way that expression 
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of the coding sequence is achieved under conditions compatible with the control 
sequences. The control elements need not be contiguous with the coding sequence, so 
long as they function to direct the expression thereof. Thus, for example, intervening 
untranslated yet transcribed sequences can be present between a promoter and the coding 
5 sequence and the promoter can still be considered "operably linked" to the coding 

sequence. 

A "heterologous sequence" as used herein typically refers to a nucleic acid 
sequence that is not normally found in the cell or organism of interest.. For example, a 
DNA sequence encoding a polypeptide can be obtained from a plant cell and introduced 

10 into a bacterial cell. In this case the plant DNA sequence is "heterologous" to the native 

DNA of the bacterial cell 

The "native sequence" or "wild-type sequence" of a gene is the polynucleotide 
sequence that comprises the genetic locus corresponding to the gene, e.g., all regulatory 
and open-reading frame coding sequences required for expression of a completely 

15 functional gene product as they are present in the wild-type genome of an organism. The 

native sequence of a gene can include, for example, transcriptional promoter sequences, 
translation enhancing sequences, introns, exons, and poly-A processing signal sites. It is 
noted that in the general population, wild-type genes may include multiple prevalent 
versions that contain alterations in sequence relative to each other and yet do not cause a 

20 discernible pathological effect. These variations are designated "polymorphisms" or 

"allelic variations." 

"Recombinant" as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with all or a portion of the polynucleotide 

25 with which it is associated in nature; and/or (2) is linked to a polynucleotide other than 

that to which it is linked in nature. The term "recombinant" as used with respect to a 
protein or polypeptide means a polypeptide produced by expression of a recombinant 
polynucleotide. 

By "vector" is meant any genetic element, such as a plasmid, phage, transposon, 
30 cosmid, chromosome, virus etc., which is capable of transferring gene sequences to target 
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cells. Generally, a vector is capable of replication when associated with the proper 
control elements. Thus, the term includes cloning and expression vehicles, as well as 
viral vectors and integrating vectors. 

As used herein, the term "expression cassette" refers to a molecule comprising at 
least one coding sequence operably linked to a control sequence which includes all 
nucleotide sequences required for the transcription of cloned copies of the coding 
sequence and the translation of the mRNAs in an appropriate host cell. Such expression 
cassettes can be used to express eukaryotic genes in a variety of hosts such as bacteria, 
blue-green algae, plant cells, yeast cells, insect cells and animal cells. Under the 
invention, expression cassettes can include, but are not limited to, cloning vectors, 
specifically designed plasmids, viruses or virus particles. The cassettes may further 
include an origin of replication for autonomous replication in host cells, selectable 
markers, various restriction sites, a potential for high copy number and strong promoters. 

A cell has been "transformed" by an exogenous polynucleotide when the 
polynucleotide has been introduced inside the cell. The exogenous polynucleotide may 
or may not be integrated (covalently linked) into chromosomal DNA making up the 
genome of the cell. In procaryotes and yeasts, for example, the exogenous DNA may be 
maintained on an episomal element, such as a plasmid. With respect to eucaryotic cells, a 
stably transformed cell is one in which the exogenous DNA has become integrated into 
the chromosome so that it is inherited by daughter cells through chromosome replication. 
This stability is demonstrated by the ability of the eucaryotic cell to establish cell lines or 
clones comprised of a population of daughter cells containing the exogenous DNA. 

"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and 
other such terms denoting procaryotic microorganisms or eucaryotic cell lines cultured as 
unicellular entities, are used interchangeably, and refer to cells which can be, or have 
been, used as recipients for recombinant vectors or other transfer DNA, and include the 
progeny of the original cell which has been transfected. It is understood that the progeny 
of a single parental cell may not necessarily be completely identical in morphology or in 
genomic or total DNA complement to the original parent, due to accidental or deliberate 
mutation. Progeny of the parental cell which are sufficiently similar to the parent to be 



17 



2225-0003 
PATENT 



characterized by the relevant property, such as the presence of a nucleotide sequence 
encoding a desired peptide, are included in the progeny intended by this definition, and 
are covered by the above terms. 

The term "dwfl polynucleotide" refers to a polynucleotide derived from, or 
5 homologous to, the dwfl gene. The gene encodes the protein variously referred to herein 

as DWF7, STE1 and DWF7/STE1. DWF7 is a A 7 sterol C-5 desaturase that functions in 
the brassinolide (BL) biosynthetic pathway from squalene to BL (see, Figure 1). The 
dwfl polynucleotide sequence and corresponding amino acid sequence are known and 
have been described in, e.g., Gachotte et al. (1996) Plant J. 9:391-398 and GenBank 

10 accession No. AF105034. See, also, Figures 8A-8D depicting the dwfl gene sequence 
and the corresponding DWF7 amino acid sequence. As shown in Figures 8A-8D, the 
dwfl gene spans the region from nucleotide positions 1-1889; the upstream 5' UTR, 
including the promoter region, spans nucleotide positions 1-142; the downstream 3' UTR 
is present from nucleotide position 1710-1889. The term as used herein encompasses a 

15 polynucleotide including a native sequence depicted in Figures 8A-8D, as well as 
modifications and fragments thereof. 

The term encompasses alterations to the polynucleotide sequence, so long as the 
alteration results in a plant displaying one or more dwfl phenotypic traits (described 
below) when the polynucleotide is expressed in a plant. Such modifications typically 

20 include deletions, additions and substitutions, to the native dwfl sequence, so long as the 
mutation results in a plant displaying a dwfl phenotype as defined below. These 
modifications may be deliberate, as through site-directed mutagenesis, or may be 
accidental, such as through mutations of plants which express the dwfl polynucleotide or 
errors due to PCR amplification. The term encompasses expressed allelic variants of the 

25 wild-type dwfl sequence which may occur by normal genetic variation or are produced 

by genetic engineering methods and which result in a detectable change in the wild-type 
dwfl phenotype. Two particular dwfl allelic variants described herein are dwfl-1 and 
dwfl -2. Polypeptides corresponding to these variants include about amino acids 1-60 
and 1-230, respectively, of Figure 9. However, the boundaries of these polypeptides may 

30 vary by 1 to 10 or more amino acids, or any integer therebetween. Thus, dwf7-l and 
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dwf7-2 polypeptides may include, for example, amino acids 1-59 and 1-229, respectively, 
or 3-62 and 3-232, respectively, and so on. Also described herein is a dwf7 
polynucleotide termed "HDF7" The term "dwf7 polynucleotide" as used herein, is 
intended to encompass the HDF7 polynucleotide. This polynucleotide is shown in 
Figures 10A-10F herein. The polypeptide encoded by HDF7 is depicted at about 
positions 1-279 of Figure 11. As with the dwf7-l and dwf7~2 polypeptides, the 
boundaries of the HDF7 polypeptide may also vary by 1 to 10 or more amino acids, or 
any integer therebetween. These molecules are discussed in detail below. 

The term "dwf7 phenotype" as used herein refers to any microscopic or 
macroscopic change in structure or morphology of a plant, such as a transgenic plant, as 
well as biochemical differences, which are characteristic of a dwf7 plaint, compared to a 
progenitor, wild-type plant cultivated under the same conditions. Generally, 
morphological differences include short robust stems, reduced fertility, prolonged life 
cycle, dark-green, round, and curled leaves when grown in the light, In the dark, these 
plants exhibit short hypocotyls and expanded cotyledons, as compared to the wild-type 
plant. The height of such plants will typically be 75% or less of the wild-type plant, more 
typically 50% or less of the wild-type plant, and even more typically 25% or less of the 
wild-type plant, or any integer in between. Additional phenotypic morphological 
attributes of the dwf7 mutant are summarized in Table 1 of the examples. Biochemically, 
dwf7 hypocotyls are converted to wild-type length with the application of BL. 

A "polypeptide" is used in it broadest sense to refer to a compound of two or more 
subunit amino acids, amino acid analogs, or other peptidomimetics. The subunits may be 
linked by peptide bonds or by other bonds, for example ester, ether, etc. As used herein, 
the term "amino acid" refers to either natural and/or unnatural or synthetic amino acids, 
including glycine and both the D or L optical isomers, and amino acid analogs and 
peptidomimetics. A peptide of three or more amino acids is commonly called an 
oligopeptide if the peptide chain is short. If the peptide chain is long, the peptide is 
typically called a polypeptide or a protein. Full-length proteins, analogs, and fragments 
thereof are encompassed by the definition. The terms also include postexpression 
modifications of the polypeptide, for example, glycosylation, acetylation, 
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phosphorylation and the like. Furthermore, as ionizable amino and carboxyl groups are 
present in the molecule, a particular polypeptide may be obtained as an acidic or basic 
salt, or in neutral form. A polypeptide may be obtained directly from the source 
organism, or may be recombinantly or synthetically produced (see further below). 
5 A "DWF7" polypeptide is a polypeptide as defined above, which is derived from 

a A 7 sterol C-5 desaturase that functions in the brassinolide (BL) biosynthetic pathway 
from squalene to BL (see, Figure 1). The native sequence of full-length DWF7 is shown 
in Figure 9. However, the term encompasses analogs and fragments of the native 
sequence so long as the protein functions for its intended purpose. Moreover, the term 
10 "DWF7 polypeptide" is intended to encompass the HDF7 polypeptide and analogs 
thereof. 

The term "DWF7 analog" refers to derivatives of DWF7 and HDF7, or fragments 
of such derivatives, that retain desired function, e.g., as measured in assays as described 
further below. In general, the term "analog" refers to compounds having a native 

1 5 polypeptide sequence and structure with one or more amino acid additions, substitutions 
(generally conservative in nature) and/or deletions, relative to the native molecule, so 
long as the modifications do not destroy desired activity. Preferably, the analog has at 
least the same activity as the native molecule. Methods for making polypeptide analogs 
are known in the art and are described further below. 

20 Particularly preferred analogs include substitutions that are conservative in nature, 

i.e., those substitutions that take place within a family of amino acids that are related in 
their side chains. Specifically, amino acids are generally divided into four families: (1) 
acidic ~ aspartate and glutamate; (2) basic — lysine, arginine, histidine; (3) non-polar ~ 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and 

25 (4) uncharged polar — glycine, asparagine, glutamine, cysteine, serine threonine, tyrosine. 

Phenylalanine, tryptophan, and tyrosine are sometimes classified as aromatic amino 
acids. For example, it is reasonably predictable that an isolated replacement of leucine 
with isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a 
similar conservative replacement of an amino acid with a structurally related amino acid, 

30 will not have a major effect on the biological activity. It is to be understood that the 
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terms include the various sequence polymorphisms that exist, wherein amino acid 
substitutions in the protein sequence do not affect the essential functions of the protein. 

By "purified" and "isolated" is meant, when referring to a polypeptide or 
polynucleotide, that the molecule is separate and discrete from the whole organism with 

5 which the molecule is found in nature; or devoid, in whole or part, of sequences normally 
associated with it in nature; or a sequence, as it exists in nature, but haiving heterologous 
sequences (as defined below) in association therewith. It is to be understood that the 
term "isolated" with reference to a polynucleotide intends that the polynucleotide is 
separate and discrete from the chormosome from which the polynucleotide may derive. 

10 The term "purified" as used herein preferably means at least 75% by weight, more 

preferably at least 85% by weight, more preferably still at least 95% by weight, and most 
preferably at least 98% by weight, of biological macromolecules of the same type are 
present. An "isolated polynucleotide which encodes a particular polypeptide" refers to a 
nucleic acid molecule which is substantially free of other nucleic acid molecules that do 

15 not encode the subject polypeptide; however, the molecule may include some additional 

bases or moieties which do not deleteriously affect the basic characteristics of the 
composition. 

By "fragment" is intended a polypeptide or polynucleotide consisting of only a 
part of the intact sequence and structure of the reference polypeptide or polynucleotide, 

20 respectively. The fragment can include a 3' or C-terminal deletion or a 5' or N-terminal 

deletion, or even an internal deletion, of the native molecule. A polynucleotide fragment 
of a dwp sequence will generally include at least about 15 contiguous bases of the 
molecule in question, more preferably 18-25 contiguous bases, even more preferably 30- 
50 or more contiguous bases of the dwp molecule, or any integer between 15 bases and 

25 the full-length sequence of the molecule. Fragments which provide at least one dwf7 

phenotype as defined above are useful in the production of transgenic plants. Fragments 
are also useful as oligonucleotide probes, to find additional dwp sequences. 

Similarly, a polypeptide fragment of a DWF7 molecule will generally include at 
least about 10 contiguous amino acid residues of the full-length molecule, preferably at 

30 least about 15-25 contiguous amino acid residues of the full-length molecule, and most 
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preferably at least about 20-50 or more contiguous amino acid residues of the full-length 
DWF7 molecule, or any integer between 10 amino acids and the full-length sequence of 
the molecule. Such fragments are useful for the production of antibodies and the like. 
By "transgenic plant" is meant a plant into which one or more exogenous 
5 polynucleotides have been introduced. Examples of means by which this can be 

accomplished are described below, and include Agrobacterium-mediated transformation, 
biolistic methods, electroporation, and the like. In the context of the present invention, 
the transgenic plant contains a polynucleotide which is not normally present in the 
corresponding wild-type plant and which confers at least one dw/7 phenotypic trait to the 

10 plant. The transgenic plant therefore exhibits altered structure, morphology or 

biochemistry as compared with a progenitor plant which does not contain the transgene, 
when the transgenic plant and the progenitor plant are cultivated under similar or 
equivalent growth conditions. Such a plant containing the exogenous polynucleotide is 
referred to here as an generation transgenic plant. Transgenic plants may also arise 

15 from sexual cross or by selfing of transgenic plants into which exogenous 

polynucleotides have been introduced. Such a plant containing the exogenous nucleic 
acid is also referred to here as an R x generation transgenic plant. Transgenic plants which 
arise from a sexual cross with another parent line or by selfing are "descendants or the 
progeny" of a R t plant and are generally called F n plants or S n plants, respectively, n 

20 meaning the number of generations. 

II Modes of Carrying Out the Invention 

Before describing the present invention in detail, it is to be understood that this 
invention is not limited to particular formulations or process parameters as such may, of 
25 course, vary. It is also to be understood that the terminology used herein is for the 

purpose of describing particular embodiments of the invention only, and is not intended 
to be limiting. 

Although a number of compositions and methods similar or equivalent to those 
described herein can be used in the practice of the present invention, the preferred 
30 materials and methods are described herein. 
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The present invention is based on the morphological, biochemical, and molecular 
analysis of Arabidopsis dwf7 mutants. Morphologically, dwf7 plants display a dramatic 
reduction in the length of many different organs examined, and this size reduction is 
attributable to a defect in cell elongation. Biochemically, dwf7 hypocotyls are converted 

5 to wild-type length with the application of BL, suggesting a deficiency in BRs. In 

agreement with this, BR intermediate feeding analysis, accompanied by analysis of 
endogenous levels of BRs and sterols by using GC-SIM, indicates that dwf7 is defective 
specifically in the A 7 sterol C-5 desaturase step of the sterol biosynthetic pathway. 
Sequencing of the A 7 sterol C-5 desaturase gene in two allelic variants, dwf7-l and 

10 dwf7-2, revealed premature stop codons, suggesting loss-of-function mutations. Thus, it 

appears that a shortage of sterols leads to a drastic reduction of BR levels in dwf7 mutants 
and to the characteristic dwarf phenotype. 

The molecules of the present invention are therefore useful in the production of 
transgenic plants which display at least one dwf7 phenotype, so that the resulting plants 

1 5 have altered structure or morphology. The present invention particularly provides for 

altered structure or morphology such as reduced cell length, extended flowering periods, 
increased size of leaves or fruit, increased branching, increased seed production and 
altered sterol composition relative wild-type plants. The DWF7 polypeptides can be 
expressed to engineer a plant with desirable properties. The engineering is accomplished 

20 by transforming plants with nucleic acid constructs described herein which may also 

comprise promoters and secretion signal peptides. The transformed plants or their 
progenies are screened for plants that express the desired polypeptide. 

Engineered plants exhibiting the desired altered structure or morphology can be 
used in plant breeding or directly in agricultural production or industrial 

25 applications. Plants having the altered polypeptide can be crossed with other altered 

plants engineered with alterations in other growth modulation enzymes, proteins or 
polypeptides to produce lines with even further enhanced altered structural morphology 
characteristics compared to the parents or progenitor plants. 
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Isolation of Nucleic Acid Sequences from Plants 

The isolation of dw/7 sequences from the polynucleotides of the invention may be 
accomplished by a number of techniques. For instance, oligonucleotide probes based on 
the sequences disclosed here can be used to identify the desired gene in a cDNA or 
genomic DNA library from a desired plant species. To construct genomic libraries, large 
segments of genomic DNA are generated by random fragmentation, e.g. using restriction 
endonucleases, and are ligated with vector DNA to form concatemers that can be 
packaged into the appropriate vector. To prepare a library of tissue-specific cDNAs, 
mRNA is isolated from tissues and a cDNA library which contains the gene transcripts is 
prepared from the mRNA. 

The cDNA or genomic library can then be screened using a probe based upon the 
sequence of a cloned gene such as the polynucleotides disclosed here. Probes may 
be used to hybridize with genomic DNA or cDNA sequences to isolate homologous 
genes in the same or different plant species. Alternatively, the nucleic acids of interest 
can be amplified from nucleic acid samples using amplification techniques. For instance, 
polymerase chain reaction (PCR) technology to amplify the sequences of the genes 
directly from mRNA, from cDNA, from genomic libraries or cDNA libraries. PCR and 
other in vitro amplification methods may also be useful, for example, to clone nucleic 
acid sequences that code for proteins to be expressed, to make nucleic acids to use as 
probes for detecting the presence of the desired mRNA in samples, for nucleic acid 
sequencing, or for other purposes. 

Appropriate primers and probes for identifying rfw/7-specific genes from plant 
tissues are generated from comparisons of the sequences provided herein. For a general 
overview of PCR see Innis et al. eds, PCT Protocols: A Guide to Methods and 
Applications, Academic Press, San Diego (1990). Appropriate primers for this invention 
include, for instance, those primers described in the Examples and Sequence Listings, as 
well as other primers derived from the dwf sequences disclosed herein. Suitable 
amplifications conditions may be readily determined by one of skill in the art in view of 
the teachings herein, for example, including reaction components and amplification 
conditions as follows: 10 mM Tris-HCl, pH 8.3, 50 mM potassium chloride, 1.5 mM 
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magnesium chloride, 0.001% gelatin, 200 \M dATP, 200 \M dCTP, 200 \M dGTP, 200 
\iM dTTP, 0.4 jxM primers, and 100 units per mL Taq polymerase; 96 °C for 3 min., 30 
cycles of 96°C for 45 seconds, 50°C for 60 seconds, 72°C for 60 seconds, followed by 
72°Cfor5min. 

5 Polynucleotides may also be synthesized by well-known techniques as described 

in the technical literature. See, e.g., Carruthers, et al. (1982) Cold Spring Harbor Symp. 
Quant. Biol 47:411-418, and Adams, et al. (1983) J. Am. Chem. Soc. 105:661. Double 
stranded DNA fragments may then be obtained either by synthesizing the complementary 
strand and annealing the strands together under appropriate conditions, or by adding the 

1 0 complementary strand using DNA polymerase with an appropriate primer sequence. 

The polynucleotides of the present invention may also be used to isolate or create 
other mutant cell gene alleles. Mutagenesis consists primarily of site-directed 
mutagenesis followed by phenotypic testing of the altered gene product. Some of the 
more commonly employed site-directed mutagenesis protocols take advantage of vectors 

15 that can provide single stranded as well as double stranded DNA, as needed. Generally, 
the mutagenesis protocol with such vectors is as follows. A mutagenic primer, i.e., a 
primer complementary to the sequence to be changed, but consisting of one or a small 
number of altered, added, or deleted bases, is synthesized. The primer is extended in 
vitro by a DNA polymerase and, after some additional manipulations, the now 

20 double-stranded DNA is transfected into bacterial cells. Next, by a variety of methods, 
the desired mutated DNA is identified, and the desired protein is purified from clones 
containing the mutated sequence. For longer sequences, additional cloning steps are often 
required because long inserts (longer than 2 kilobases) are unstable in those vectors. 
Protocols are known to one skilled in the art and kits for site-directed mutagenesis are 

25 widely available from biotechnology supply companies, for example from Amersham 
Life Science, Inc. (Arlington Heights, 111.) and Stratagene Cloning Systems (La Jolla, 
Calif.). 
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Control elements 

Regulatory regions can be isolated from the dwp gene and used in recombinant 
constructs for modulating the expression of the dwf7 gene or a heterologous gene in vitro 
and/or in vivo. As shown in Figures 8A-8D, the coding region of the dwf7 gene 
5 (designated by the light grey bar) begins at nucleotide position 143. The region of the 
gene spanning nucleotide positions 1-142 of Figures 8A-8D includes the dwf7 promoter. 
This region may be used in its entirety or fragments of the region may be isolated which 
provide the ability to direct expression of a coding sequence linked thereto. 

Thus, promoters can be identified by analyzing the 5' sequences of a genomic 

10 clone corresponding to the rfw/7-specific genes described here. Sequences characteristic 
of promoter sequences can be used to identify the promoter. Sequences controlling 
eukaryotic gene expression have been extensively studied. For instance, promoter 
sequence elements include the TATA box consensus sequence (TATAAT), which is 
usually 20 to 30 base pairs upstream of the transcription start site. In most instances the 

1 5 TATA box is required for accurate transcription initiation. In plants, further upstream 

from the TATA box, at positions -80 to -100, there is typically a promoter element with a 
series of adenines surrounding the trinucleotide G (or T) N G. (See, J. Messing et al., in 
Genetic Engineering in Plants, pp. 221-227 (Kosage, Meredith and Hollaender, eds. 
(1983)). Methods for identifying and characterizing promoter regions in plant genomic 

20 DNA are described, for example, in Jordano et al. (1989) Plant Cell 1 : 855-866; Bustos et 
al. (1989) Plant Cell 1:839-854; Green et al. (1988) EMBOJ. 7:4035-4044; Meier et al. 
(1991) Plant Cell 3:309-316; and Zhang et al. (1996) Plant Physiology 1 10:1069-1079). 

Additionally, the promoter region may include nucleotide substitutions, insertions 
or deletions that do not substantially affect the binding of relevant DNA binding proteins 

25 and hence the promoter function. It may, at times, be desirable to decrease the binding of 
relevant DNA binding proteins to "silence" or "down-regulate" a promoter, or conversely 
to increase the binding of relevant DNA binding proteins to "enhance" or "up-regulate" a 
promoter. In such instances, the nucleotide sequence of the promoter region may be 
modified by, e.g., inserting additional nucleotides, changing the identity of relevant 
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nucleotides, including use of chemically-modified bases, or by deleting one or more 
nucleotides. 

Promoter function can be assayed by methods known in the art, preferably by 

measuring activity of a reporter gene operatively linked to the sequence being tested for 
5 promoter function. Examples of reporter genes include those encoding luciferase, green 

fluorescent protein, GUS, neo, cat and bar. 

Polynucleotides comprising untranslated (UTR) sequences and intron/exon 

junctions are also within the scope of the invention. UTR sequences include introns and 

5' or 3' untranslated regions ( 5' UTRs or 3' UTRs). As shown in Figure 6, the dwf7 
10 gene sequence includes three exons (thick boxes) and two introns (horizontal bars). See, 

also, Figures 8A-8D for the 5' and 3' UTRs. Similarly, the HDF7 gene includes three 

exons (at positions 1506-1734, 2024-2329 and 2416-2720, denoted by the corresponding 

protein sequence indicated) and two introns (between these exons) and 5 r and 3 T UTRs. 

These portions of the dwf7 and HDF7 genes especially UTRs, can have regulatory 
15 functions related to, for example, translation rate and mRNA stability. Thus, these 

portions of the gene can be isolated for use as elements of gene constracts for expression 

of polynucleotides encoding desired polypeptides. 

Introns of genomic DNA segments may also have regulatory junctions. 

Sometimes promoter elements, especially transcription enhancer or suppressor elements, 
20 are found within introns. Also, elements related to stability of heteronuclear RNA and 

efficiency of transport to the cytoplasm for translation can be found in intron elements. 

Thus, these segments can also find use as elements of expression vectors intended for use 

to transform plants. 

The introns, UTR sequences and intron/exon junctions can vary from the native 
25 sequence. Such changes from those sequences preferably will not affect the regulatory 

activity of the UTRs or intron or intron/exon junction sequences on expression, 
transcription, or translation. However, in some instances, down-regulation of such 
activity may be desired to modulate traits or phenotypic or in vitro activity. 
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Use of Nucleic Acids of the Invention to Inhibit Gene Expression 

The isolated sequences prepared as described herein, can be used to prepare 
expression cassettes useful in a number of techniques. For example, expression cassettes 
of the invention can be used to suppress endogenous dwf7 gene expression. Inhibiting 
5 expression can be useful, for instance, in suppressing the phenotype (e.g., dwarf 

appearance, the A 7 sterol C-5 desaturase activity) exhibited by dwf7 plants. 

A number of methods can be used to inhibit gene expression in plants. For 
instance, antisense technology can be conveniently used. To accomplish this, a nucleic 
acid segment from the desired gene is cloned and operably linked to a promoter such that 

10 the antisense strand of RNA will be transcribed. The expression cassette is then 

transformed into plants and the antisense strand of RNA is produced. In plant cells, it has 
been suggested that antisense RNA inhibits gene expression by preventing the 
accumulation of mRNA which encodes the enzyme of interest, see, e.g., Sheehy et al. 
(1988) Proc. Nat Acad. Set USA 85:8805-8809, and Hiatt et al., U.S. Patent Number 

15 4,801,340. 

The nucleic acid segment to be introduced generally will be stLbstantially identical 
to at least a portion of the endogenous gene or genes to be repressed. The sequence, 
however, need not be perfectly identical to inhibit expression. The vectors of the present 
invention can be designed such that the inhibitory effect applies to other proteins within a 

20 family of genes exhibiting homology or substantial homology to the target gene. 

For antisense suppression, the introduced sequence also need not be full length 
relative to either the primary transcription product or fully processed mRNA. Generally, 
higher homology can be used to compensate for the use of a shorter sequence. 
Furthermore, the introduced sequence need not have the same intron or exon pattern, and 

25 homology of non-coding segments may be equally effective. Normally, a sequence of 

between about 30 or 40 nucleotides and about full length nucleotides should be used, 
though a sequence of at least about 100 nucleotides is preferred, a sequence of at least 
about 200 nucleotides is more preferred, and a sequence of at least about 500 nucleotides 
is especially preferred. It is to be understood that any integer between the above-recited 

30 ranges is intended to be captured herein. 
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Catalytic RNA molecules or ribozymes can also be used to inhibit expression of 
dwf7 genes. It is possible to design ribozymes that specifically pair with virtually any 
target RNA and cleave the phosphodiester backbone at a specific location, thereby 
functionally inactivating the target RNA. In carrying out this cleavage, the ribozyme is 
5 not itself altered, and is thus capable of recycling and cleaving other molecules, making it 

a true enzyme. The inclusion of ribozyme sequences within antisense RNAs confers 
RNA-cleaving activity upon them, thereby increasing the activity of the constructs. 

A number of classes of ribozymes have been identified. One class of ribozymes is 
derived from a number of small circular RNAs which are capable of self-cleavage and 

10 replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper 

virus (satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the 
satellite RNAs from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco 
mottle virus, solanum nodiflorum mottle virus and subterranean clover mottle virus. The 
design and use of target RNA-specific ribozymes is described in Haseloff et al. (1988) 

15 Nature 334:585-591. 

Another method of suppression is sense suppression. Introduction of expression 
cassettes in which a nucleic acid is configured in the sense orientation with respect to the 
promoter has been shown to be an effective means by which to block the transcription of 
target genes. For an example of the use of this method to modulate expression of 

20 endogenous genes see, Napoli et al. (1990) The Plant Cell 2:279-289 and U.S. Patent 

Numbers 5,034,323, 5,231,020, and 5,283,184. 

Generally, where inhibition of expression is desired, some transcription of the 
introduced sequence occurs. The effect may occur where the introduced sequence 
contains no coding sequence per se, but only intron or untranslated sequences 

25 homologous to sequences present in the primary transcript of the endogenous sequence. 

The introduced sequence generally will be substantially identical to the endogenous 
sequence intended to be repressed. This minimal identity will typically be greater than 
about 65%, but a higher identity might exert a more effective repression of expression of 
the endogenous sequences. Substantially greater identity of more than about 80% is 

30 preferred, though about 95% to absolute identity would be most preferred. It is to be 
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understood that any integer between the above-recited ranges is intended to be captured 
herein. As with antisense regulation, the effect should apply to any other proteins within 
a similar family of genes exhibiting homology or substantial homology. 

For sense suppression, the introduced sequence in the expression cassette, needing 

5 less than absolute identity, also need not be full length, relative to either the primary 

transcription product or fully processed mRNA. This may be preferred to avoid 
concurrent production of some plants which are overexpressers. A higher identity in a 
shorter than full length sequence compensates for a longer, less identical sequence. 
Furthermore, the introduced sequence need not have the same intron or exon pattern, and 

10 identity of non-coding segments will be equally effective. Normally, a sequence of the 

size ranges noted above for antisense regulation is used. 

Use of Nucleic Acids of the Invention to Enhance Gene Expression 

In addition to inhibiting certain features of a plant, the polynucleotides of the 
1 5 invention can be used to increase certain features such as extending flowering, producing 

larger leaves or fruit, producing increased branching and increasing seed production. 

This can be accomplished by the overexpression of dw/7 polynucleotides. 

The exogenous dwf7 polynucleotides do not have to code for exact copies of the 

endogenous DWF7 and HDF7 proteins. Modified DWF7 and HDF7 protein chains can 
20 also be readily designed utilizing various recombinant DNA techniques well known to 

those skilled in the art and described for instance, in Sambrook et al., supra. 

Hydroxylamine can also be used to introduce single base mutations into the coding region 

of the gene (Sikorski et al. (1991) Meth. EnzymoL 194: 302-318). For example, the 

chains can vary from the naturally occurring sequence at the primary structure level by 
25 amino acid substitutions, additions, deletions, and the like. These modifications can be 

used in a number of combinations to produce the final modified protein chain. 
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Preparation of Recombinant Vectors 

To use isolated sequences in the above techniques, recombinant DNA vectors 
suitable for transformation of plant cells are prepared. Techniques for transforming a 
wide variety of higher plant species are well known and described further below as well 

5 as in the technical and scientific literature. See, for example, Weising et al. (1988) Ann. 

Rev. Genet. 22:421-477. A DNA sequence coding for the desired polypeptide, for 
example a cDNA sequence encoding the full length DWF7 protein, will preferably be 
combined with transcriptional and translational initiation regulatory sequences which will 
direct the transcription of the sequence from the gene in the intended tissues of the 

10 transgenic plant. 

Such regulatory elements include but are not limited to the promoters derived 
from the genome of plant cells (e.g., heat shock promoters such as soybean hspl7,5-E or 
hspl7.3-B (Gurley et al. (1986) Mol Cell Biol. 6:559-565); the promoter for the small 
subunit of RUBISCO (Coruzzi et al. (1984) EMBOJ. 3:1671-1680; Broglie et al. (1984) 

15 Science 224:838-843); the promoter for the chlorophyll a/b binding protein) or from plant 
viruses viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson 
et al. (1984) Nature 310:51 1-514), or the coat protein promoter of TMV (Takamatsu et al. 
(1987) EMBO J. 6:307-31 1), cytomegalovirus hCMV immediate early gene, the early or 
late promoters of SV40 adenovirus, the lac system, the trp system, the TAC system, the 

20 TRC system, the major operator and promoter regions of phage A, the control regions of 
fd coat protein, the promoter for 3-phosphoglycerate kinase, the promoters of acid 
phosphatase, heat shock promoters {e.g., as described above) and the promoters of the 
yeast alpha-mating factors. 

In construction of recombinant expression cassettes of the invention, a plant 

25 promoter fragment may be employed which will direct expression of the gene in all 
tissues of a regenerated plant. Such promoters are referred to herein as "constitutive" 
promoters and are active under most environmental conditions and states of development 
or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic 
virus (CaMV) 35S transcription initiation region, the T-DNA mannopine synthetase 

30 promoter (e.g., the 1- or 2'- promoter derived from T-DNA of Agrobacterium 



31 



2225-0003 
PATENT 



tumafaciens), and other transcription initiation regions from various plant genes known to 
those of skill. 

Alternatively, the plant promoter may direct expression of the polynucleotide of 
the invention in a specific tissue (tissue-specific promoters) or may be otherwise under 

5 more precise environmental control (inducible promoters). Examples of tissue-specific 
promoters under developmental control include promoters that initiate transcription only 
in certain tissues, such as fruit, seeds, or flowers such as tissue- or developmental-specific 
promoter, such as, but not limited to the cell promoter, the CHS promoter, the PATATIN 
promoter, etc. The tissue specific E8 promoter from tomato is particularly useful for 

10 directing gene expression so that a desired gene product is located in fruits. 

Other suitable promoters include those from genes encoding embryonic storage 
proteins. Examples of environmental conditions that may affect transcription by 
inducible promoters include anaerobic conditions, elevated temperature, or the presence 
of light. If proper polypeptide expression is desired, a polyadenylation region at the 

1 5 3 T -end of the coding region should be included. The polyadenylation region can be 

derived from the natural gene, from a variety of other plant genes, or from T-DNA. In 
addition, the promoter itself can be derived from the dwf7 or HDF7 genes, as described 
above. 

The vector comprising the sequences (e.g., promoters or coding regions) from 
20 genes of the invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. For example, the marker may encode biocide resistance, 
particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, 
hygromycin, or herbicide resistance, such as resistance to chlorosluforon or Basta. 

25 Production of Transgenic Plants 

DNA constructs of the invention may be introduced into the genome of the 
desired plant host by a variety of conventional techniques. For reviews of such 
techniques see, for example, Weissbach & Weissbach Methods for Plant Molecular 
Biology (1988, Academic Press, N.Y.) Section VIII, pp. 421-463; and Grierson & Corey, 

30 Plant Molecular Biology (1988, 2d Ed.), Blackie, London, Ch. 7-9. For example, the 
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DNA construct may be introduced directly into the genomic DNA of the plant cell using 
techniques such as electroporation and microinjection of plant cell protoplasts, or the 
DNA constructs can be introduced directly to plant tissue using biolistic methods, such as 
DNA particle bombardment (see, e.g., Klein et al. (1987) Nature 327:70-73). 
5 Alternatively, the DNA constructs may be combined with suitable T-DNA flanking 
regions and introduced into a conventional Agrobacterium tumefaciens host vector. 
Agrobacterium tumefaciens-mediated transformation techniques, including disarming 
and use of binary vectors, are well described in the scientific literature. See, for example 
Horsch et al (1984) Science 233:496-498, and Fraley et al. (1983) Proa Natl Acad. Set 

10 USA 80:4803. The virulence functions of the Agrobacterium tumefaciens host will direct 

the insertion of the construct and adjacent marker into the plant cell DNA when the cell is 
infected by the bacteria using binary T DNA vector (Bevan (1984) Nuc. Acid Res. 
12:871 1-8721) or the co-cultivation procedure (Horsch et al. (1985) Science 
227:1229-1231). Generally, the Agrobacterium transformation system is used to engineer 

15 dicotyledonous plants (Bevan et al. (1982) Ann, Rev. Genet 16:357-384; Rogers et al. 

(1986) Methods EnzymoL 1 1 8:627-641). The Agrobacterium transformation system may 
also be used to transform, as well as transfer, DNA to monocotyledonous plants and plant 
cells, (see Hernalsteen et al. (1984) EMBO 73:3039-3041; Hooykass-Van Slogteren et al. 
(1984) Nature 31 1:763-764; Grimsley et al. (1987) Nature 325:1677-179; Boulton et al. 

20 (1989) Plant Mol Biol 12:31-40.; and Gould et al. (1991) Plant Physiol 95:426-434). 

Alternative gene transfer and transformation methods include, but are not limited 
to, protoplast transformation through calcium-, polyethylene glycol (PEG)- or 
electroporation-mediated uptake of naked DNA (see Paszkowski et al. (1984) EMBO J 
3:2717-2722, Potrykus et al. (1985) Molec. Gen. Genet 199:169-177; Fromm et al 

25 (1985) Proc. Nat. Acad. Set USA 82:5824-5828; and Shimamoto (1989) Nature 

338:274-276) and electroporation of plant tissues (D'Halluin et al. (1992) Plant Cell 
4: 1495-1505). Additional methods for plant cell transformation include microinjection, 
silicon carbide mediated DNA uptake (Kaeppler et al. (1990) Plant Cell Reporter 
9:415-418), and microprojectile bombardment (see Klein et al. (1988) Proc. Nat Acad. 

30 Sci. USA 85:4305-4309; and Gordon-Kamm et al. (1990) Plant Cell 2:603-618). 
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Transformed plant cells which are produced by any of the above transformation 
techniques can be cultured to regenerate a whole plant which possesses the transformed 
genotype and thus the desired phenotype. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically 
5 relying on a biocide and/or herbicide marker which has been introduced together with the 

desired nucleotide sequences. Plant regeneration from cultured protoplasts is described in 
Evans, et al, "Protoplasts Isolation and Culture" in Handbook of Plant Cell Culture, pp. 
124-176 ? Macmillian Publishing Company, New York, 1983; and Binding, Regeneration 
of Plants, Plant Protoplasts, pp. 21-73, CRC Press, Boca Raton, 1985, Regeneration can 

10 also be obtained from plant callus, explants, organs, pollens, embryos or parts thereof. 

Such regeneration techniques are described generally in Klee et al. (1987) Ann. Rev. of 
Plant Phys. 38:467-486. 

The nucleic acids of the invention can be used to confer desired traits on 
essentially any plant. A wide variety of plants and plant cell systems may be engineered 

15 for the desired physiological and agronomic characteristics described herein using the 

nucleic acid constructs of the present invention and the various transformation methods 
mentioned above. In preferred embodiments, target plants and plant cells for engineering 
include, but are not limited to, those monocotyledonous and dicotyledonous plants, such 
as crops including grain crops (e.g., wheat, maize, rice, millet, barley), fruit crops (e.g., 

20 tomato, apple, pear, strawberry, orange), forage crops (e.g., alfalfa), root vegetable crops 

(e.g., carrot, potato, sugar beets, yam), leafy vegetable crops (e.g., lettuce, spinach); 
flowering plants (e.g., petunia, rose, chrysanthemum), conifers and pine trees (e.g., pine 
fir, spruce); plants used in phytoremediation (e.g., heavy metal accumulating plants); oil 
crops (e.g., sunflower, rape seed) and plants used for experimental purposes (e.g., 

25 Arabidopsis). Thus, the invention has use over a broad range of plants, including, but not 

limited to, species from the genera Asparagus, Avena, Brassica, Citrus, Citrullus, 
Capsicum, Cucurbita, Daucus, Glycine, Hordeum, Lactuca, Lycopersicon, Malus, 
Manihot, Nicotiana, Oryza, Persea, Pisum, Pyrus, Primus, Raphanus, Secale, Solanum, 
Sorghum, Triticum, Vitis, Vigna, and Zea. 
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One of skill in the art will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be introduced into 
other plants by sexual crossing. Any of a number of standard breeding techniques can be 
used, depending upon the species to be crossed. 

5 A transformed plant cell, callus, tissue or plant may be identified and isolated by 

selecting or screening the engineered plant material for traits encoded by the marker 
genes present on the transforming DNA. For instance, selection may be performed by 
growing the engineered plant material on media containing an inhibitory amount of the 
antibiotic or herbicide to which the transforming gene construct confers resistance. 

1 0 Further, transformed plants and plant cells may also be identified by screening for the 

activities of any visible marker genes (e.g., the p -glucuronidase, luciferase, B or CI 
genes) that may be present on the recombinant nucleic acid constructs of the present 
invention. Such selection and screening methodologies are well known to those skilled in 
the art. 

1 5 Physical and biochemical methods also may be used to identify plant or plant cell 

transformants containing the gene constructs of the present invention. These methods 
include but are not limited to: 1) Southern analysis or PCR amplification for detecting 
and determining the structure of the recombinant DNA insert; 2) Northern blot, SI RNase 
protection, primer-extension or reverse transcriptase-PCR amplification for detecting and 

20 examining RNA transcripts of the gene constructs; 3) enzymatic assays for detecting 

enzyme or ribozyme activity, where such gene products are encoded by the gene 
construct; 4) protein gel electrophoresis, Western blot techniques, immunoprecipitation, 
or enzyme-linked immunoassays, where the gene construct products are proteins. 
Additional techniques, such as in situ hybridization, enzyme staining, and 

25 immunostaining, also may be used to detect the presence or expression of the 

recombinant construct in specific plant organs and tissues. The methods for doing all 
these assays are well known to those skilled in the art. 

Effects of gene manipulation using the methods of this invention can be observed 
by, for example, northern blots of the RNA (e.g., mRNA) isolated from the tissues of 

30 interest. Typically, if the amount of mRNA has increased, it can be assumed that the 
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endogenous dwf7 gene is being expressed at a greater rate than before. Other methods of 
measuring DWF7 activity can be used. For example, cell length can be measured at 
specific times. Because dwf7 affects the BR biosynthetic pathway, an assay that 
measures the amount of BL can also be used. Such assays are known in the art. Different 

5 types of enzymatic assays can be used, depending on the substrate used and the method of 

detecting the increase or decrease of a reaction product or by-product. In addition, the 
levels of DWF7 protein expressed can be measured immunochemical^, i.e., ELISA, 
RIA, EIA and other antibody based assays well known to those of skill in the art, by 
electrophoretic detection assays (either with staining or western blotting), and sterol (BL) 

10 detection assays. 

The transgene may be selectively expressed in some tissues of the plant or at some 
developmental stages, or the transgene may be expressed in substantially all plant tissues, 
substantially along its entire life cycle. However, any combinatorial expression mode is • 
also applicable. 

15 The present invention also encompasses seeds of the transgenic plants described 

above wherein the seed has the transgene or gene construct. The present invention further 
encompasses the progeny, clones, cell lines or cells of the transgenic plants described 
above wherein said progeny, clone, cell line or cell has the transgene or gene construct. 

20 Polypeptides 

The present invention also includes DWF7 polypeptides, including such 
polypeptides as a fusion, or chimeric protein product (comprising the protein, fragment, 
analog, mutant or derivative joined via a peptide bond to a heterologous protein sequence 
(of a different protein)). Such a chimeric product can be made by ligating the appropriate 

25 nucleic acid sequences encoding the desired amino acid sequences to each other by 
methods known in the art, in the proper coding frame, and expressing the chimeric 
product by methods commonly known in the art. 

In addition, DWF7 polypeptides, derivatives (including fragments and chimeric 
proteins), mutants and analogues can be chemically synthesized. See, e.g., Clark-Lewis 

30 et al. (1991) Biochem. 30:3128-3135 and Merrifield (1963) J. Amer. Chem. Soc. 
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85:2149-2156. For example, DWF7, derivatives, mutants and analogs can be synthesized 
by solid phase techniques, cleaved from the resin, and purified by preparative high 
performance liquid chromatography (e.g., see Creighton, 1983, Proteins, Structures and 
Molecular Principles, W. H. Freeman and Co., N.Y., pp. 50-60). DWF7, derivatives and 
5 analog that are proteins can also be synthesized by use of a peptide synthesizer. The 

composition of the synthetic peptides may be confirmed by amino acid analysis or 
sequencing (e.g., the Edman degradation procedure; see Creighton, 1983, Proteins, 
Structures and Molecular Principles, W. H. Freeman and Co., N.Y., pp. 34-49). 

10 Applications 

The present invention finds use in various applications, for example, including but 
not limited to those listed above. 

The polynucleotide sequences may additionally be used to isolate mutant dwf7 
gene alleles. Such mutant alleles may be isolated from plant species either known or 

15 proposed to have a genotype which contributes to altered plant morphology. 

Additionally, such plant dwf7 gene sequences can be used to detect plant dwf7 gene 
regulatory (e.g., promoter or promotor/enhancer) defects which can affect plant growth. 

The molecules of the present invention can be used to provide plants with 
increased seed and/fruit production, extended flowering periods and increased branching. 

20 The molecules described herein can be used to alter the sterol composition of a plant, 
thereby increasing or reducing cholesterol content in the plant. A still further utility of 
the molecules of the present invention is to provide a tool for studying the biosynthesis of 
brassinosteriods, both in vitro and in vivo. 

The dwf7 gene of the invention also has utility as a transgene encoding a the A 7 

25 sterol C-5 desaturation protein that mediates one or more steps in brassinosteriod 

biosynthesis which results in a transgenic plant to alter plant structure or morphology. 
The dwfl gene also has utility for encoding the DWF7 protein in recombinant vectors 
which may be inserted into host cells to express the DWF7 protein. Further, the dwf7 
polynucleotides of the invention may be utilized (1) as nucleic acid probes to screen 

30 nucleic acid libraries to identify other enzymatic genes or mutants; (2) as nucleic acid 
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sequences to be mutated or modified to produce DWF7 protein variants or derivatives; 
(3) as nucleic acids encoding the A 7 sterol C-5 desaturases in molecular biology 
techniques or industrial applications commonly known to those skilled in the art. 

The dwf7 nucleic acid molecules may be used to design antisense molecules, 

5 useful, for example, in gene regulation or as antisense primers in amplification reactions 
of dwf7 gene nucleic acid sequences. With respect to dwf7 gene regulation, such 
techniques can be used to regulate, for example, plant growth, development or gene 
expression. Further, such sequences may be used as part of ribozyme and/or triple helix 
sequences, also useful for dwf7 gene regulation. 

10 The dwf7 control element (e.g., promoter) of the present invention may be utilized 

as a plant promoter to express any protein, polypeptide or peptide of interest in a 
transgenic plant. In particular, the dw/7 promoter may be used to express a protein 
involved in brassinosteriod biosynthesis. 

The Arabidopsis DWF7 protein of the invention can be used in any biochemical 

15 applications (experimental or industrial) where A 7 sterol C-5 desaturation activity is 

desired, for example, but not limited to, regulation of BL synthesis, regulation of other 
sterol synthesis, modification of elongating plant structures, and experimental or 
industrial biochemical applications known to those skilled in the art. 

20 III. Experimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not intended 
to limit the scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
25 amounts, temperatures, etc.), but some experimental error and deviation should, of 

course, be allowed for. 

Restriction and modifying enzymes, as well as PCR reagents were purchased from 
commercial sources, and used according to the manufacturers 1 directions. In the cloning 
of DNA fragments, except where noted, all DNA manipulations were done according to 
30 standard procedures. See, e.g., Sambrook et al., supra. Restriction enzymes, T^ DNA 
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ligase, E. coli, DNA polymerase I, Klenow fragment, and other biological reagents were 
purchased from commercial suppliers and used according to the manufacturers 1 
directions. 

5 Materials and Methods 

A. Plant Growth 

For sterile growth of Arabidopsis thaliana plants, seeds of mutants and the wild 
type were sterilized (50% Clorox and 0.005% Triton X-100) for 8 min, washed three 
times with sterile distilled water, and dried with 95% ethanol. The seeds were sprinkled 

10 on 0.8% agar-solidified media or in liquid media containing 1 x Murashige and Skoog 

(Murashige and Skoog (1962) Physiol. Plant. 15:473-497) salts and 0,5% sucrose (pH 5.8 
with KOH). For the plants grown in the dark, the seeds on the plates were illuminated for 
3 hr (240 ^imol m" 2 sec" 1 ) before being wrapped with two or three layers of aluminum foil. 
For the mature plants used for morphometric analysis and gas chromatography-selective 

15 ion monitoring (GC-SIM) studies, seeds were planted on soil (Metromix 350; Grace 

Sierra Co., Milpitas, CA) presoaked with distilled water. The flats containing the pots 
were covered with plastic wrap and cold-treated at 4°C for 2 days before transfer to a 
growth chamber (16 hr of light [240 jamol m" 2 sec" 1 ] and 8 hr of dark at 22 and 21 °C, 
respectively, and 75 to 90% humidity). The plastic wrap was removed after 2 to 3 days. 

20 The pots were subirrigated in distilled water or Hoagland's nutrient solution as required. 

B. Morphometric and Physiological Analysis 

At 5 weeks of age, the various morphological traits listed in Table 1 (below) were 
measured. The number of seeds per silique was determined after the plants were 

25 completely dried. Unopened siliques from each plant were selected and crushed, and the 

number of seeds was counted under a dissecting microscope. To measure the fresh and 
dry weight, the aerial parts of the plants were cut and immediately weighed to obtain the 
fresh weight; the plants were then completely dried in a 60° C oven for 5 days before 
measuring the dry weight. Flowers were harvested immediately after petal opening. 

30 Observations on the structure of flowers were made with flowers at stage 14 (Smyth et al. 
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(1990) Plant Cell 2:755-767), which are right beneath the cluster of developing flowers at 
the shoot apices. Individual organs of a flower were separated under the dissecting 
microscope. The length of the organs was measured to a tenth of a millimeter, and the 
four longest stamens for each flower were measured and the mean value calculated. 

The anatomical studies using a scanning electronic microscope and a light 
microscope were performed as described by Azpiroz et al. (1998) Plant Cell 10:219-230. 

C. Mapping and Sequencing of the DWARF7 Locus 

The mapping of dwfl was performed using simple sequence length polymorphism 
(SSLP) markers (Bell and Ecker (1994) Genomics 19:137-144). Briefly, dwp-1 mutants 
(Wassilewskija-2 [Ws-2] background) were crossed to Columbia wild-type plants. 
Genomic DNA was isolated (Dellaporta et al. 1983) from individual F 2 dwarf plants. To 
locate the mutation to one of the five chromosomes, 20 individual plants were tested with 
at least two SSLP markers per chromosome. The polymerase chain reaction (PCR) 
amplified products were analyzed on 4% agarose gels in 1 x TAE buffer (40 mM 
Tris-acetate and 10 mM EDTA). Once the dw/7-1 mutation was shown to be linked to 
the ngal62 marker located on chromosome 3 (recombination ratio 1 1.9%), we tested 
marker ngal 72, which maps at 2.2 centimorgans. No recombination was detected 
between the dw/7-1 mutation and ngal 72 when 86 chromosomes were tested, suggesting 
that dwp-1 is linked closely to the ngal 72 marker. Linkage between the markers and the 
dwarf phenotype was determined according to Koornneef and Stam (1992) Genetic 
analysis. In Methods in Arabidopsis Research, C. Koncz, N.-H. Chua, and J. Schell, eds 
(Singapore: World Scientific Publishing Co.), pp. 83-99. 

PCR products amplified using primer sets derived from the cDNA sequence of 
STEROL1 (STE1) were subjected to sequencing. To design sets of primers that do not 
fall in exon-intron junctions, we predicted possible splice sites by using the RNASPL 
program available at the internet site of Baylor College of Medicine (Houston, TX; 
http://dot.imgen.bcm.tmc. edu:9331/seq-search/gene-search,html). Primers were designed 
using the Primer Selection software of DNAstar (DNASTAR Inc., Madison, WI). 
Oligonucleotide sequences 5' to 3' are CAGTGTGAGTAAT T TAGCAT TACTA 
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(S5D_FF), GGAAAGATCATC-AAACAT T TACATGT (S5D_LR), GCGCAATCT 
TCT T TCGT T T (S5DJF), TGGACAACAACAACACAAGA (S5D_1R), 
GATGCACAGAGAGCT- TCATGAC (S5D_2F), CCGGCAAATGGAGAGAGTGTAT 
(S5D_2R), CACCCATCATATCTACAACAA (S5D_3F), and CATCT T T 

5 TGCCG-GCGAATCTAT (S5D_4F) (underlines were added to distinguish forward or 

reverse primers from the gene acronym S5D). Primers were purchased from Genosys 
Biotechnologies, Inc. (The Woodlands, TX). For template DNA, genomic DNA was 
isolated from two or three leaves of dwf7-l and wild-type plants according to the method 
described by Krysan et al. (1996) Proc. Natl. Acad. Sci. USA 93:8145-8150. 

1 0 Amplification of the DNA fragment spanning the whole coding region was performed 

with the S5D_4F and S5D_1R primer set with Taq polymerase (Boehringer Mannheim). 

Standard PCR reaction mixtures, 1 x PCR buffer (10 mM Tris-HCl, 1.5 mM 
MgCl 2 , and 50 mM KC1, pH 8.3), 0.2 (iM each of forward and reverse primer, 0.2 mM 
each deoxynucleotide triphosphates, 1 ng of genomic DNA, and 2 units of Taq 

1 5 polymerase were subjected to a PCR program consisting of an initial denaturation at 

95°C for 2 min and then for 35 cycles (95°C for 30 sec, 56°C for 30 sec, and 72°C for 
2.5 min ), with a final elongation step of 7 min at 72°C. PCR-amplified DNA was 
size-separated on 0.8% agarose gels in 1 x TAE, and the resulting DNA bands were 
gel-purified using a DNA purification kit (Bio-Rad). The concentration of the extracted 

20 DNA was measured by comparing the band intensity with a DNA mass standard 

(Bethesda Research Laboratories). Sequencing of the DNA was performed at the Arizona 
Research Laboratory (University of Arizona, Tucson). DNA sequence analysis was 
conducted using software packages, including one from Genetics Computer Group 
(Madison, WI) and other database search tools available on the Internet. 

25 The base change in dwf7-l eliminated the recognition site for a restriction enzyme 

Haelll by converting the sequence from GGCC to AGCC. Thus, we utilized this 
polymorphism to test the cosegregation of the dwarf phenotype with the mutation. The 
0.8 kb of DNA spanning the mutation was amplified using S5D3F and S5D__1R primers 
from 17 different dwarf plants from the mapping lines. Two microliters from each 20 \iL 

30 of PCR-amplified DNA was digested with the restriction enzyme Haelll (Boehringer 
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Mannheim). After complete digestion, the samples were resolved on a 2% agarose gel in 
lxTAE buffer. 

Genomic DNA sequence flanking the cDNA was identified by sequencing the 
products obtained from thermal asymmetric interlaced PCR (TAIL PGR) (Liu et al. 
5 (1995) Plant J. 8:457-463). Two sets of primers were used to amplify the 5' and 3' 
flanking DNA. Oligonucleotide sequences 5' to 3' are 

GTAGAAGCACCAGAGGAAACCGGAGATGAAGT (D7-5-1; melting temperature of 
69°C) ? AAGTATAGTAGGGT TCCGGCGAGG-TA (D7-5-2; melting temperature of 
64 °C), ATAGAT TCGCCG-GCAAAAGATGACTC (D7-5-3; melting temperature of 

10 63°C), TGC-AGGATACCATACGATACACCACACGACAT (D7-3-1; melting 

temperature of 68°C), CATACGATACACCACACGACATACAAGCAT-AACTA 
(D7-3-2; melting temperature of 67°C), and ATATGGATG-GAT TGGATGT T 
TGGCTCTC (D7-3-3; melting temperature of 63 °C). The melting temperature of each 
primer was calculated with the formula 69.3 + 0.41 (%GC) - 650/L (Mazars et al. (1991) 

15 Nucleic Acids Res. 19:4783), where L is length of primer. Arbitrary degenerate primers 
AD1, AD2, and AD3 were synthesized according to the sequence described by Liu et al. 
(1995) Plant J. 8:457-463. TAIL PCR was performed according to the program 
originally described by Liu et al. 1995. TAIL PCR-amplified DNA was separated on 1% 
agarose gels and gel extracted for sequencing. 

20 

D. Feeding Experiments 

Biochemical complementation of dwf7-l plants with different concentrations of 
brassinolide (BL) was performed in liquid media. BL-supplemented (control, 10" 9 , 10~\ 
and 10" 7 M) sterile liquid media (1.5 mL) was dispensed into wells of a 24-well plate 
25 (Corning Co., Corning, NY). Three seedlings, germinated on agar-solidified media, were 
transferred into each well. After a week of growth with continuous shaking (230 rpm), 
the seedlings were lightly stained with toluidine blue, and hypocotyls and roots were 
measured to the nearest millimeter. 

Feeding experiments using biosynthetic intermediates were performed with 
30 3-week-old mutant plants. The intermediates tested were diluted to the desired 
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concentration with water containing 0.01% Tween 20. Two microliters of each 
brassinosteroid (BR) solution was applied daily to the shoot tips of plants by using a 
micro pipettman. After 1 week of treatment, total growth of inflorescence and pedicels 
was measured to the nearest millimeter (n = 15). 

E. Analysis of Endogenous BRs 

Plants were grown for 5 weeks on soil. Two hundred grams of the aerial parts of 
plants, including stems, flowers, leaves, and siliques, was harvested and subjected to BR 
extraction. The procedure for extraction and analysis of BR intermediates by using 
GC-SIM has been described (Fujioka et al. (1997) Plant Cell 9:1951-1 962). 

F. 13 C-Labeled Mevalonic Acid Feeding Experiments 

Before feeding experiments, seedlings were germinated and grown on 0.5 x 
Murashige and Skoog (Murashige and Skoog (1962) Physiol Plant. 15:473-497) agar 
medium in the light at 22 °C (25 mL per dish). Eight days after sowing, the seedlings 
were transferred to a 200-mL flask containing 30 mL of Murashige and Skoog 
(Murashige and Skoog (1962) Physiol. Plant. 15:473-497) media supplemented with 3% 
sucrose (Ws-2, five seedlings; dwf7-l, 40 seedlings). 

Compactin (mevastatin; Sigma) was converted to its sodium salt as described 
previously (Kita et al (1980) J. Clin. Invest. 66:1094-1100). DL-Mevalonolactone-2- 13 C 
( 13 C-MVA; Isotec, Miamisburg, OH) was dissolved in methanol. Solutions of compactin 
and 13 C-MVA were added aseptically to each 200-mL flask (final concentration, 10 \iM 
compactin and 4.5 mM 13 C-MVA) just after the seedlings were transferred, and seedlings 
were allowed to grow for 1 1 days at 22 °C in the light on a shaker (110 rpm). After 
incubation, the seedlings (~5 g fresh weight of both Ws-2 and dwf7-l plant materials) 
were extracted with methanol (250 mL), and the extract was partitioned between CHC1 3 
and H 2 0. The CHCl 3 -soluble fraction was purified with a silica cartridge column 
(Sep-Pak Vac 12 cc; Waters, Milford, MA), which was eluted with 20 mL of CHC1 3 . The 
eluate was purified with an octadecylsilane (ODS) cartridge column (Sep-Pak PLUS CI 8; 
Waters), which was eluted with 20 mL of methanol. The fraction was subjected to HPLC 
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on an ODS column as follows: column, Senshu Pak ODS 4150-N (150 x 10 mm); 

solvent, methanol; flow rate, 2 mL /min; and detection, UV 205 nm. Fractions were 

collected every 0.5 min (between retention times of 10 to 20 min). Main fractions of each 

sterol were as follows: 5-dehydroepisterol (retention time of 1 1.5 to 12 min), episterol 
5 (retention time of 12.5 to 13 min), 24-methylenecholesterol (24-MC; retention time of 13 

to 13.5 min), 7-dehydrocampestanol (retention time of 14.5 to 15 min), and campesterol 

(CR; retention time of 15.5 to 16 min). 

Each fraction was converted to a trimethylsilyl derivative and analyzed by gas 

chromatography-mass spectrometry (GC-MS). GC-MS analyses were performed on a 
10 JEOL Automass JMS-AM 150 mass spectrometer (Tokyo, Japan) connected to a 

Hewlett-Packard 5890A-II gas chromatograph with a capillary column DB-5 (0.25 mm x 

15 m; 0.25-^im film thickness). The analytical conditions were the same as previously 

described (Fujioka et al. 1997). 

5-Dehydroepisterol, episterol, and 7-dehydrocampestanol were chemically 
15 synthesized. 

Example 1 
Isolation of dwf7 Mutants 
The dwp-l mutant originally was identified in a screen of 14,000 
20 T-DNA-transformed lines of Arabidopsis. Genetic complementation tests with other dwf 

loci indicated that dwf7 belongs to a unique complementation group. dwf7-l segregated 
as a monogenic recessive mutation; progeny from a heterozygote segregated 325 (wild- 
type) :98 (dwf7-l). Although dwf7-l originated from a T-DNA mutant population, it 
failed to cosegregate with the kanamycin resistance marker in the T-DNA, suggesting 
25 that dwp-1 was an untagged mutant. Furthermore, mapping the dwf/-l mutation to the 

Arabidopsis genome by using simple sequence length polymorphisms (SSLPs; Bell and 
Ecker (1994) Genomics 19:137-144) confirmed that dw/7 maps to a location different 
from previously isolated dwarfs. The meiotic recombination ratio between dwf7 and the 
SSLP marker ngal 72 on chromosome 3 was scored as 0 / 86, indicating tight linkage of 
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dwf7 to ngal 72. According to a recent recombinant inbred map of Arabidopsis, ngal 72 
is located 2.2 centimorgans from the top of chromosome 3. 

A second allele of dwf7 was identified among 43 dwarf mutants isolated by 
screening >5 0 3 000 M2 seeds of an EMS mutant population. Similar to dw/7-1, the new 
5 allele was biochemically complemented by early BR biosynthetic intermediates, 

including 22 a-hydroxycampesterol (22-OHCR) and cathasterone, and mapped near 
ngal 72. Sequencing revealed a premature stop codon in exon 1 (see below). 



Example 2 

10 Morphological Analysis of dw/7-1 

dwf7 displays many of the characteristics of other BR dwarfs. The characteristic 
dwarf phenotype, such as short robust stems, reduced fertility, and dark-green, round, and 
curled leaves are found in the plants. Compared with 1 -month-old wild-type plants, 
dw/7-1 plants grown for 5 weeks in the light possess short robust inflorescences, 

15 dark-green, round leaves, reduced fertility, and short pedicels and siliques. The wild-type 

generally terminates flowering before 7 weeks of age; however, dwf7-l continues to 
produce flowers at this age. At 7 weeks of age, wild-type plants had ceased growing, 
whereas dw/7-1 plants continued to grow, indicating a prolonged life span. 

Additional morphological defects of 5-week-old light-grown plants are 

20 summarized in Table 1. Most noticeably, the height of dwf7-l plants is strikingly 

reduced and is only 14% that of wild-type height. The leaf blade width of dwf7-l 
mutants is similar to that of wild-type plants; however, the length is greatly reduced (1.8 
cm) as compared with that of the wild type (3 cm), resulting in the round shape of dw/7-1 
leaves. The overall morphology of dwf7-2 was similar to dw/7-1 except that it was 

25 slightly shorter and more sterile. 
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Table 1. Morphometric Analysis of Wild-Type and dw/7-iPlants at 5 Weeks of Age 


Measurement (n = 15) 


Wild Type 


dwf-1 


Inflorescence 
Height (cm) 

Number of inflorescences 


31.6 ±0.9 
3.9 ± 0.6 


4.5 ± 0.4 
4.3 ± 0.5 


Reproductive organs 
Number of reproductive organs 
Length of siliques (mm) 
Number of seeds a 


130.2 ± 12.9 
14.8 ± 1.2 
49.7 ±5.1 


89.3 ± 20.9 
3.9 ±0.8 

12.4 ±2.4 


Leaf 

Number of resette leaves 
Leaf blade width (cm) b 
Leaf blade length (cm) b 


9.1 ± 1.2 
1.4 ±0.1 
3.0 ±0.3 


10.3 ± 1.9 
1.4 ±0.3 
1.8 ±0.3 


Weight 

Fresh weight (g) 
Dry weight (mg) 
Fresh weight/dry weight 


1.50 ±0.19 
215 ±29 
7.0 ± 0.3 


0.51 ±0.10 
53 ± 11 
9.7 ± 0.6 



a The number of sees per silique was determined after plant senescence. 
b The second pair of rosette leaves. 

25 Because null mutations in the BR pathway result in a dwarf phenotype, as well as 

defects in skotomorphogenesis, we compared the dwfl-1 mutant with other BR dwarfs for 
growth in the dark. Hypocotyl lengths from the longest to the shortest were 18+1.6 
(wild-type; units in millimeters + SE; n = 15), 6.3 ± 0.29 (dwf7-l), 4.1 ± 0.03 
(det2/dwf6), 1.26 ± 0.09 {dwf4\ 1.24 ± 0.08 {cpd/dwf3\ and 1.18 ± 0.08 (bril/dwf2). 

30 These data indicate that dwj7-l displays a less severe phenotype (35% that of wild-type 

hypocotyl length) than do other BR dwarfs (e.g., 7% of wild type in dwf4; Choe et al. 
(1998) Plant Cell 10:231-243). Furthermore, dwf7-l frequently displayed closed 
cotyledons and hooks similar to those of the wild type, whereas severe dwarfs, including 
bril/dw/2, cpd/dwf3 y and dwf4 9 showed expanded cotyledons and open hooks. 



46 



2225-0003 
PATENT 



Unlike severe dwarfs, such as dwf4 and cpd, dwf7-l mutants are not mechanically 
sterile. However, the average number of seeds in a silique is reduced in dw/7-1 (n = 12) 
compared with that of the wild-type for reasons yet to be identified (n = 49) (Table 1). 
Scanning electron microscopy demonstrated a relationship between fertility and floral 

5 structure. In the wild type, the length of stamens was greater than or similar to that of the 

gynoecium (quantified in Figure 2), facilitating dehiscence of pollen on the stigmatic 
surface. The fertile dwft-1 flower had a concomitant reduction in the size of the 
gynoecium and the stamen. Although dwf7-l flowers (Figure 2) possess stamens and 
gynoecia that are shorter than those in the wild type, the fertility of dw?f7-l flowers is 

10 possible through the concomitant reduction in the length of both organs. In contrast, only 

stamen elongation was affected more severely in dwf4-3 flowers (Figure 2). Because 
sterile dw/4-3 flowers have shorter filaments than the gynoecium, pollen dehiscence on 
the stigmatic surface is prevented. The short stamen length in dwf4 is likely to cause 
dehiscence of pollen on the ovary wall rather than on the stigmatic surface. In fact, when 

15 dw/4 pollen is transferred to either wild-type or dwf7-l stigmas, viable seeds are made. 

The common denominator for the various phenotypes found in dwf7-l mutants is 
a reduction in longitudinal growth, which could be due to either a reduced number of 
cells or a failure in cell elongation. Observations made with other BEL dwarf mutants 
suggest that the number of cells is comparable in the wild type and mutants (Kauschmann 

20 et al. (1996) Plant J, 9:701-713; Nomura et al. (1997) Plant Physiol. 113:31-37; Azpiroz 

et al. (1998) Plant Cell 10:219-230). The length of cells in the epidermis, cortex, and 
xylem of dwf7-l was greatly reduced (<30% of wild type). This reduced cell size was 
converted to the length of the wild type in response to daily application of 10" 7 M BL for 
1 week. Thus, the reduced organ length in dwf7-l also is due to a failure of cell 

25 elongation. 

The organization of vascular bundles in wild-type and dwf7-l mutants was also 
examined. Wild-type inflorescences possessed eight vascular bundles. However, the 
number of vascular bundles was reduced to six in dwf7-L Furthermore, the spacing 
between the vascular bundles in dw/7-1 was irregular. In the wild type, interfascicular 

30 parenchyma cells alternated regularly with vascular bundles; however, cross-sections of 
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dwft-1 showed that two vascular bundles were joined without being separated by 
parenchyma cells. Within a single vascular bundle, the size and number of xylem cells in 
dwf7-l plants generally were reduced, whereas the number of phloem cells was similar to 
or even greater than that in the wild-type. This characteristic abnormality of vascular 
5 bundle organization has been observed consistently in other BR dwarfs (Szekeres et al. 
(1996) Cell 85:171-182). 

Example 3 

Biochemical Complementation of dwf7-l with BL 

10 Figure 3 demonstrates that dwf7-l seedlings grown in BL-supplemented liquid 

media were remarkably sensitive to BL. Growth in 1 nM BL induced significant 
elongation of dwf7-l hypocotyls (160% increase), whereas the wild-type increase was 
marginal (5%). Treatment with 10 and 100 nM BL completely rescued dw/7-1 
hypocotyls to wild-type length. The strongest response of the wild type to BL was 

15 obtained at 100 nM (Figure 3). Higher concentrations of BL (1 \xM) caused a stressed 

morphology, including inhibition of root growth and swollen, twisted, and fragile 
hypocotyls in both dwp-1 and wild-type plants. After BL treatment of dwf7-l, cells in 
the treated region of the stem were similar in length to wild-type cells. 

The overall morphology of plants is dependent on three factors: cell size, shape, 

20 and number (Cosgrove (1997) Plant Cell 9:1031-1041). Various signals modulate these 

factors. Environmental signals, such as water, temperature, and light, are transduced to 
invoke internal hormone signals, including auxins, gibberellins, and BRs. These signals 
then trigger the cell elongation process, including but not limited to cell wall loosening 
by xyloglucan endotransglycosylases and expansins. Thus, a block in any of the signal 

25 transduction cascades from the environmental signals to the cell elongation process could 

result in dwarfism. Mutants resistant to or deficient in classic hormones, such as auxin 
(e.g., auxin resistant! [axr2]; Timpte (1992) Genetics 138:1239-1249) and gibberellin 
([gal to ga5 andgaz]; Koornneef and van der Veen (1980) Theor. Appl. Genet. 
58:257-263; Koornneef et al. (1985) Physiol. Plant. 65:33-39), often result in dwarfism. 

30 Thus, we first tested whether dw/7 is either rescued by or resistant to exogenous 
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application of these hormones. Three- week-old dwf7-l plants sprayed with 0.1 mM GA 3 
responded, as did the wild-type (<10% increase of inflorescence height); however, GA 3 
did not rescue the dwpA phenotype. In addition, dwf7-l roots grown on indole acetic 
acid-supplemented agar media (0.1 \iM) displayed stunted morphology similar to that of 
5 the wild-type, suggesting that dwf7-l is not resistant to the exogenous application of 

auxin. The reduction of hypocotyl length in dwf7-l was rescued by the application of BL 
(Figure 3). Both wild-type and dwf7-l plants responded to BL, but dwf7-l plants were 
hypersensitive. The length of dwf7-l hypocotyls was increased 160% in response to 1 
nM BL as compared with the untreated control, whereas the wild-type responded 

10 marginally (5%). In addition, application of BRs to 3-week-old dwf7-l plants induced 

the growth of many different organs, including stems, leaves, siliques, petioles, and 
pedicels, suggesting that the major defect in dwp-1 is a deficiency of BL. 

Apart from a reduction in cell elongation, a deficiency of endogenous BRs 
resulted in altered organization of vascular tissue in the inflorescence. Szekeres et al. 

15 (1996) Cell 85:171-182 showed that the number of xylem cells in cpd was decreased as 

compared with the wild-type, whereas the number of phloem cells was increased. The 
authors reasoned that this could be due to unequal division of cambial cells. 
Furthermore, previous studies on the effects of BRs on vascular development indicated 
that BRs play a role in tracheary element formation (Clouse and Zurek (1991) Molecular 

20 analysis of brassinolide action in plant growth and development. In Brassinosteroids: 

Chemistry, Bioactivity and Applications, H.G. Cutler, T. Yokota, and G. Adam, eds 
(Washington DC: American Chemical Society), pp. 122-140; Iwasaki and Shibaoka 
(1991) Plant Cell. Physiol. 32:1007-101). Because BRs also have been found in the 
cambial region of pine, indicative of an important role in this tissue (Kim et al. (1990) 

25 Plant Physiol. 94:1709-1713), we hypothesize that the deficiency of BRs in dwarf 

mutants caused changes in cell fate in vascular cambial cells through yet unknown 
mechanisms. 

Auxins also are known to be a major factor affecting differentiation of the 
vascular system (Aloni (1987) Annu. Rev. Plant Physiol. 38:179-204). Lincoln et al 
30 (1990) Plant Cell 2:1071-1080 showed that stem cross-sections of axrl displayed altered 
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development of the vascular system. The vascular bundles in axrl mutants are located 
peripherally and are not as regularly spaced as compared with those in wild-type plants 
(Lincoln et al (1990) Plant Cell 2:1071-1080). Furthermore, as opposed to the reduced 
number of vascular bundles in dwp-1 (five to seven), axrl plants possess a greater 
5 number of bundles (eight to nine) as compared with the wild type (six to eight). Thus, it 
seems that auxins and BRs play opposing roles in determining the number of vascular 
bundles. Two other assays in which auxin and BR interactions have been demonstrated 
are the rice lamina bending assay and hypocotyl hook opening bioassay. Results from 
these assays include the fact that the degree of effect caused by the combined application 

10 of auxin and BR was greater than was the sum of the effect of each, indicative of a 

synergistic effect of the two hormones (Yopp et al. (1981) Physiol. Plant. 53:445-452; 
Takeno and Pharis (1982) Plant Cell Physiol. 23:1275-1281 reviewed in Mandava (1988) 
Annu. Rev. Plant Physiol. Plant Mol. Biol. 39:23-52). However, the details of the 
mechanisms for interactive and independent action remain to be elucidated. 

15 It needs to be pointed out that hypocotyl growth in darkness is accomplished 

through both GA- and BR-dependent cell elongation processes. One piece of evidence 
for dependence on both GA and BR is that dwfl-1 hypocotyls elongated fivefold in 
response to darkness as compared with light-grown hypocotyls, although they are still 
shorter than those of the wild-type. Because BL levels are not detectable in dw/7-1 plants 

20 (Table 2), growth of dwp-1 in the dark could be accomplished mostly by GA-dependent 
cell elongation processes. Peng and Harberd (1997) Plant Physiol. 113:1051-1058 and 
Azpiroz et al (1998) Plant Cell 10:219-230 found that both gai and dwf4, respectively, 
partially suppressed the stem elongation phenotype of a light receptor mutant, hy, 
suggesting that hypocotyl elongation in the absence of light inhibition requires 

25 independent growth contributed by both GA and BRs. 
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Example 4 

Identification of the BR Biosvnthetic Defect in dwf/-l 
Biochemical complementation of dwf7-l following application of BL suggested 
that dwf7-l is likely to be defective in BR biosynthesis. To pinpoint the defective step in 

5 the BR biosynthetic pathway, dwfl-1 mutants were treated with BR biosynthetic 

intermediates. Due to undetectable bioactivity of some early intermediates (CR to 
6-oxocampestanol) in bioassays (Fujioka et al. 1995; Choe et al. (1998) Plant Cell 
10:231-243), these were not used. Instead, three biologically active compounds were 
chosen, 22-OHCR, 6-deoxoCT, and BL, for these feeding tests (see Figure 1). Because 

10 the 22a-hydroxylation reaction is reported to be mediated by DWF4 (Choe et al. (1998) 
Plant Cell 10:231-243), biochemical complementation of dwf mutants other than dwf4 by 
22-OHCR places the defective step upstream of CR. 

Complementing compounds induced growth of internodes and strongly increased 
pedicel length. The dw/7-1 pedicels treated with 22-OHCR and BL showed growth 

1 5 greater than or equal to that of the wild-type. Measurements of pedicel length shown in 

Figure 4 demonstrated that the three compounds tested, 22-OHCR, 6-deoxoCT, and BL, 
all increased dwp-1 pedicel length >200% as compared with the control, suggesting that 
the defective step in BR biosynthesis is located at or before the CR biosynthetic step. 
Similarly, 3-week-old inflorescences of dwf7-2 were tested with 22-OHCR, 6-deoxoCT, 

20 teasterone, and BL. All four compounds induced significant elongation of pedicels and 
internodes, indicating that dwf7-l and dwfl-2 share the same biosynthetic defect. 

As shown in Table 2, more definitive results indicating a specific defect in BR 
biosynthesis have been obtained from gas chromatography-selective ion monitoring 
(GC-SIM) analysis of endogenous BRs and sterols in dwf7-l plants. The endogenous 

25 levels of sterols, such as 24-MC, CR, and campestanol (CN), in wild-type plants, were 
3800, 32,900, and 1140 ng/g fresh weight, respectively. However, the levels of all three 
sterols in dwf7-l mutants were extremely diminished at 3.1, 1.1, and 1.4% of the wild- 
type, respectively, suggesting that the biosynthetic block is located before 24-MC. These 
data are consistent with the results of intermediate feeding studies (Figure 4). 

30 



51 



22,25-0003 
PATENT 



Table 2. Quantification of Endogenous BRs from Wild Type and dwf7-l by 
Using GC-SIM 


BRs 


Wild Tvne a 


dw/7-1 


34-MC 


3,800 


118 


CR 


32,900 


379 


CN 


1,140 


16 


fi -T) eoY ot e a eron e 


0.05 


NA b 


6-Deoxotyphasterol 


2.3 


NA 


6-Deoxocastasterone 


4.0 


ND° 


Typhasterol 


0.27 


ND 


CS 


0.28 


0.13 


BL 


0.2 


ND 



1 5 a The unit of measurement is nanograms per gram fresh weight. 
b NA, not analyzed. 

C ND, not detected. The endogenous amount of the BR is less than the detection limit 
(-0.05 ng/g fresh weight). 

20 

Further biochemical feeding studies with 13 C-labeled mevalonic acid (MVA) and 
compactin, a MVA biosynthetic inhibitor, were performed to identify the specific sterol 
biosynthetic step defective in dwf7-l plants. In a preliminary experiment, the effects of 
compactin and MVA on the growth of Arabidopsis seedlings in liquid media were 

25 investigated. The growth of wild-type Arabidopsis seedlings was almost completely 

inhibited in the presence of 10 \iM compactin. The inhibition, however, was restored to 
the level of controls by the simultaneous application of 4.5 mM of MVA. Therefore, 4.5 
mM 13 C-MVA and 10 jxM compactin were added to Arabidopsis seedling cultures in the 
metabolic feeding studies. After 1 1 days in culture, sterols were extracted and purified by 

30 silica and octadecylsilane (ODS) cartridge columns and ODS-HPLC. Purified samples 

were derivatized and analyzed by gas chromatography-mass spectrometry (GC-MS). As 
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shown in Figure 5, 13 C-MVA was converted to 13 C 5 -episterol and subsequent sterols, such 
as 13 C 5 -24-MC and 13 C 5 -CR in the wild-type. However, the 13 C 5 -5-dehydroepisterol and 
downstream compounds were not detected in dwfl-l mutants, whereas the precursor 
l3 C 5 -episterol accumulated fourfold as compared with the wild- type. In addition, an 
5 uncommon sterol, 13 C 5 -7-dehydrocampestanol (24-epifungisterol), greatly accumulated 
(Figure 5). Two lines of evidence-a failure to convert episterol to subsequent sterols, 
such as 24-MC and CR, and accumulation of 7-dehydrocampestanol in dw/7-7-suggest 
that the defective step in dwf7-l is the C-5 desaturation stop. 

A defect either in a biosynthetic enzyme or a factor modulating an enzymatic 

10 activity could lead to deficiency of endogenous BRs. To place dwf7 ait a specific step in 

the proposed BR biosynthetic pathway, we first chose to perform feeding studies with BR 
biosynthetic intermediates. Rescue of dwft-1 by exogenous application of 22-OHCR 
suggests that the biosynthetic defect likely resides before the production of CR. 
Consistent with the results from feeding studies, the endogenous levels of 24-MC, CR, 

1 5 and CN were extremely reduced in dwf7-l (Table 2). These data indicate that the 

biosynthetic defect is before 24-MC; dw/7-1 contains only 3% of 24-MC as compared 
with the wild type. When the phenotypes of dwf7-l are compared with the downstream 
biosynthetic mutant dwf4 and the BR-insensitive bril (dwfl) mutant (Clouse et al (1996) 
Plant Physiol. 1 1 1 : 67 1-678), it is obvious that dwf7-l displays a weaker phenotype 

20 despite being a presumptive null mutation. This suggests that there could be an 

alternative sterol and BR biosynthetic pathway or that there are duplicate genes at 
individual steps. Providing evidence for the duplicate gene hypothesis, we recently 
cloned a homolog of the DWF7/STE1 gene (named HOMOLOG OF DWF7, HDF7), 
shown in Figures 10 and 1 1 (GenBank Accession No. AAF32466). HDF7 is 80% 

25 identical in amino acid sequence with STE1 . Similarly, Fujioka et al. 1997 reported that 

the endogenous level of CN in det2, which is defective in a step between CR and CN, is 
-10% that of the wild-type amount. The authors hypothesized that the 10% leakage 
through the defective step in det2 mutants, even in a null allele, could be associated with 
a second copy of DET2 that lightly hybridizes in DNA gel blot analyses. 
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Placing dwf7 at a single sterol biosynthetic step was accomplished through 
feeding studies with 13 C-MVA and compactin. A greater than fourfold accumulation of 
episterol accompanying the absence of downstream intermediates in dwf7-l indicates that 
the A 7 sterol C-5 desaturase step is blocked in dwf7. In addition, the feeding studies 
identified an accumulation of 7-dehydrocampestanol, which is an uncommon sterol in 
plants (Figure 5). Accumulation of this compound only in dwfl-1 suggests that sterol 
biosynthesis in dwft-1 could proceed to a C-24 reduction step, skipping C-5 desaturation 
as well as the next immediate C-7 reduction. The C-24 reductase seems to convert 
episterol independently of the immediate upstream enzyme. The absence of a detectable 
amount of C-7-reduced compounds in dwf7-l suggests that the enzymatic step is highly 
dependent on the C-5 desaturation reaction. This confirms the sequence of reactions 
originally proposed by Taton and Rahier (1991) Biochem. Biophys. Res. Commun. 
181:465-473, Taton and Rahier (1996) Arch. Biochem. Biophys. 325:279-288. 

Example 5 
Molecular Characterization oidwfl 

An EMS-induced mutant (stel-1) of STE1 encoding a A 7 sterol C-5 desaturase did 
not possess a dwarf phenotype (Gachotte et al. (1995) Plant J. 8:407-416). However, 
because it is likely that stel-1 is a leaky allele, it was hypothesized that dwf7-l might be a 
strong or null allele. The genomic DNA of the STE1 gene was sequenced and two introns 
and three exons identified by comparing them with the published STE1 cDNA sequence. 
The organization of the STE1 gene is represented schematically in Figure 6. Sequencing 
the STE1 locus in the dwf7 alleles revealed mutations. The mutations found in dwf7-l and 
dwf7-2 were located in the third and the first exons, respectively. Both of the dwf7 alleles 
contained a base change from a guanine to an adenine, converting tryptophan (TGG) to a 
stop codon (TAG in dwf7-l and TGA in dwf-2\ 

In addition to creating a stop codon, the mutation in dwf7-l eliminated a Haelll 
restriction enzyme recognition site (GGCC to AGCC). Taking advantage of this 
restriction enzyme site change, we tested the linkage of this mutation to the dwf7-l 
phenotype. DNAs isolated from 17 different dwarf plants from a segregating F 2 
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population were subjected to polymerase chain reaction (PCR) analysis by using S5D_3F 
and S5D_1R primers (underlines were used to distinguish forward or reverse primers 
from the gene acronym S5D), and the PCR products were digested with Haelll. Agarose 
gel electrophoresis definitively showed that none of the PCR products from 17 mutant 
5 templates was restricted, whereas products from wild-type templates were all restricted at 
the Haelll site. These data suggest that the creation of the premature stop codon in exon 
3 is the cause of the dw/7-1 -conferred phenotype. 

To better understand the importance of these nonsense mutations, we analyzed the 
sequence of STE1 in relation to other C-5 desaturase proteins isolated from fungi. The 

10 STE1 protein is composed of 281 predicted amino acids with a theoretical pi of 6.39 and 
molecular mass of 33 kD. Whereas yeast ERG3 (38% identical; Arthington et al. (1991) 
Gene 107:173-174; GenBank accession number M62623) is predicted to contain four 
transmembrane domains, STE1 possesses three putative transmembrane domains. The 
overall amino acid sequence identities of STE1 with C-5 desaturases from fission yeast 

15 (GenBank accession number AB004539) and Candida glabrata (Geber et al. (1995) 

Antimicrob. Agents Chemother. 39:2708-2717; GenBank accession number L40390) 
were 37 and 33%, respectively (gap creation weight of 4; gap extension weight of 1). In 
addition, multiple sequence alignment of STE1 with the three yeast sequences, shown in 
Figure 7 5 revealed that the transmembrane domains and histidine clusters, which were 

20 first reported by Gachotte et al. (1996) Plant J. 9:391-398, are well conserved between the 
proteins. The three characteristic histidine boxes flank the last transmembrane domain. 
The nonsense mutations are located in the first exon (dwf7-2) and the third exon, 
immediately before the third histidine box {dwp-l\ indicating that at least one histidine 
domain is deleted in each of the dwf7 mutants as a result of the premature stop codons. 

25 The A 7 sterol C-5 desaturase-mediated reaction is common to both photosynthetic 

and nonphotosynthetic organisms. Many genes encoding a C-5 desaturase have been 
cloned from fungi. First, Arthington et al. (1991) Gene 107:173-174 cloned the ERG3 
gene from Saccharomyces cerevisiae. The authors found that viable erg3 mutants, which 
normally accumulate A 7 sterols, were restored to wild-type phenotype when transformed 

30 with a wild-type genomic clone of the A 7 sterol C-5 desaturase gene. Taguchi et al. 
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(1994) Microbiology 140:353-359 showed that the yeast mutant syrl displays dual 
phenotypes, resistance to the phytotoxin syringomycin and susceptibility to higher 
concentrations of Ca 2+ , presumably due to altered membranes. Sequencing the ERG3 
locus in the syrl mutant revealed that syrl is an allele of ERG3. Furthermore, Geber et 
5 al. (1995) Antimicrob. Agents Chemother. 39:2708-2717 cloned both ERG3 and ERG11 
(14a-sterol-demethylase) from C. glabrata. The authors found that lethal erg 11 
mutations can be suppressed by an additional mutation in erg3. They reasoned that 
formation of toxic 3P,6a-diol sterols in erg 11 mutants is prevented due to the defect in 
C-5 desaturation in ergll erg3 double mutants. 

10 In plants, Gachotte et al. (1995) Plant J. 8:407-416 found that the Arabidopsis 

stel-1 mutant, which is deficient in C-5 desaturated sterols, can be partially 
complemented by the yeast ERG3 gene. Accordingly, the authors hypothesized that 
stel-1 possesses a mutation in the sterol C-5 desaturase gene. They isolated the 
Arabidopsis C-5 desaturase gene through heterologous complementation of a yeast erg3 

1 5 null mutant with an Arabidopsis cDNA library (Gachotte et al. (1996) Plant J. 

9:391-398). Finally, the partial human cDNA for the C-5 desaturase has been identified 
by Matsushima et al. (1996) Cell Genet. 74:252-254. Alignment of the sequences of 
these enzymes revealed that C-5 desaturases from different organisms are highly 
conserved in overall sequence as well as in specific domains. The overall amino acid 

20 sequence identity and similarity among STE1 and ERG3 and the human ortholog is 38% 

(50%) and 35% (47%), respectively (similarity within parentheses). As indicated in 
Figure 6 and Figure 7, key domains including the transmembrane domains and the 
histidine clusters are well conserved between all the C-5 desaturases. 

Closely spaced histidine residues, HX 3 H in helices, serve as typical metal binding 

25 motifs in many proteins (Regan (1993) Annu. Rev. Biophys. Biomol. Struct. 

22:257-281). Shanklin et al. (1994) Biochemistry 33:12787-12794 showed that three 
membrane-associated bacterial enzymes, fatty acid desaturase, alkane hydroxylase, and 
xylene monooxygenase, possess eight histidine residues that are conserved in three 
regions dispersed in these enzymes, HX (3 _ 4) H, HX (2 . 3) HH, and HX (2 . 3) HH (where X stands 

30 for any amino acid). DNA constructs containing site-directed mutations at any of these 
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eight histidine residues of the rat A 9 desaturase failed to complement the yeast mutant 
olel, which is defective in the same enzymatic step, suggesting that the individual 
histidine residues are essential for the function of the enzyme. On the basis of these 
observations, Shanklin et al (1994) Biochemistry 33:12787-12794 hypothesized that the 
5 histidine clusters conserved in these enzymes constitute new structural domains of diiron 

binding centers (Shanklin et al (1994) Biochemistry 33:12787-12794). Gachotte et al 
(1996) Plant J. 9:391-398 first recognized the conserved histidine clusters in STE1 and 
yeast proteins. We confirmed that the motifs are highly conserved in STE1 and the yeast 
ERG3 enzymes with the same context of HX 3 H, HX 2 HH, and HX 2 HH (Figure 7), 

10 revealing the presence of a putative iron binding motif in A 7 sterol C-5 desaturases. 

More direct evidence of metal ion involvement in A 7 sterol C-5 desaturase 
function was obtained by Taton and Rahier (1996) Arch. Biochem. Biophys. 
325:279-288. These authors discovered that the enzyme prepared from maize 
microsomes is inhibited by cyanide, whereas it is insensitive to carbon monoxide, 

1 5 indicative of the involvement of a metal ion, presumably an iron, for the proper function 

of the enzyme. Furthermore, we noticed that the typical histidine moiety also was 
conserved in a different group of oxidases such as RANP-1 (Uwabe et al. (1997) 
Neuroscience 80:501-509), C-4 methyl sterol oxidase (Li and Kaplan (1996) J. Biol. 
Chem. 271:16927-16933), and aldehyde decarbonylase (Aarts et al. (1995) Plant Cell 

20 7:21 15-2127). Occurrence of these histidine boxes in a wide variety of oxidases indicates 

that this domain plays a common and essential role in the function of membrane oxidases. 
Therefore, it is likely that the mutations in dwf7-l and dw/7-2 would be deleterious to 
protein function. The premature stop codon in dwf7-2 would eliminate all important 
known domains, whereas the third histidine box and several amino acid residues that are 

25 100% conserved in the C terminus of the protein are eliminated in dw/7-L Intriguingly, 

the location of the mutations in dwf7-l and dwf7-2 seems to be related to the phenotypic 
severity of the mutant alleles, dw/7-2, which contains an earlier stop codon, was shorter 
in height and less fertile than dwf7-L A more precise comparison between the two alleles 
is not possible because the EMS allele, dw/7-2, has not been outcrossed to remove any 

30 background mutations that might have increased the severity of the phenotype of dw/7-2. 
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Despite the differences in severity, both dwf7 alleles are likely complete loss-of- function 
alleles. The resulting nonfunctional enzyme causes a block in sterol biosynthesis. This 
shortage of substrate sterols in dwf7-l and dwf7-2 leads to a deficiency of endogenous 
BRs and causes the characteristic dwarfism in dwf7 plants. 

Thus, novel dwJ7 mutants, as well as methods of using the same, are disclosed. 
Although preferred embodiments of the subject invention have been described in some 
detail, it is understood that obvious variations can be made without departing from the 
spirit and the scope of the invention as defined by the appended claims. 



58 



